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PREFACE 


This  volume  contains  ihu-  technical  proceedings  of  the  Workshop  on  the  Mathe- 
matics of  Profile  Inversion,  which  was  held  at  the  NASA/ Ames  Research  Center.  Moffett 
Field,  California,  during  July  12  through  July  16.  1971.  Ninety-six  invited  scientists 
from  nine  countries  representing  several  scientific  disciplines  participated  in  the  work- 
shop. Their  names  and  affiliations  are  given  in  an  appendix. 

T<>  better  understand  the  purpose  of  the  workshop  and  the  contents  of  the  proceed- 
ings, it  is  desirable  to  define  broadly,  what  is  meant  here  by  profile  inversion.  In  many 
areas  of  physical  science,  fundamental  functions  that  define  or  describe  a physical  medium 
are  computed  from  experimental  data  obtained  through  “remote  sensing”  of  that  medium, 
r.s  opposed  to  direct,  or  in  situ , measurement.  The  fundamental  function  is  related  to  the 
experimental  data  through  a differential  equation  (partial  or  ordinary),  and  the  reconstruc- 
tion of  the  basic  function  or  some  of  its  properties  constitutes  the  general  class  of  inverse 
problems.  The  term  inverse  distinguishes  these  problems  from  the  “direct”  problem 
where,  given  the  physical  medium  - its  fundamental  function  - one  calculates  hypothe- 
tical experimental  results.  The  direct  problem  is  normally  more  amenable  to  solution 
than  the  inverse  problem,  due  mainly  to  the  absence  of  both  data  limitations  and  experi- 
mental error  in  the  former.  In  most  inversion  applications  it  is  important  to  deduce  the 
spatial  variation  (radial,  height,  etc.)  cf  the  fundamental  function  from  the  experimental 
data;  hence,  the  inclusion  of  the  term  profile. 

Some  time  ago.  Dr.  Mario  Grossi  of  Raytheon  Company  was  pursuing  theoretical 
research  concerned  with  the  inversion  of  radio  data  to  determine  the  radial  refractivity 
profile  of  an  atmosphere  through  which  the  radio  waves  had  propagated  - the  radio- 
occultation  experiment,  now  performed  successfully  on  several  U.S.  planetary  probes.  In 
this  experiment,  coherent  radio  waves  are  transmitted  from  the  spacecraft  toward  earth 
and  recorded  there.  As  the  waves  traverse  the  planetary  atmosphere  and  iot  csphere  they 
are  dispersed  (and  scattered  and  absorbed)  by  those  media.  The  inversion  problem  is  to 
compute  the  refractive  ind'x  distribution  (hence  pressure,  temperature,  and  electron  den- 
sity) as  a function  cf  radius  from  the  center  of  the  planet  using  the  recorded  data  along 
with  proper  algorithms. 


Inversion  algorithms  for  this  experiment  had  been  under  development  for  several 
years  by  the  Stanford  University  and  Jet  Propulsion  Laboratory  scientists  involved,  but  a 
very  significant  contribution  was  published  in  the  March  1968  Journal  of  Geophysical 
Research  by  two  seismologists  . R.  A.  Phinney  of  Princeton  University  and  D.  L.  Anderson 
of  the  California  Institute  of  Technology.  They  showed  that  the  radio-occultation  inver- 
sion problem  is  identical,  in  principle,  to  the  problem  of  determining  the  variation  of 
seismic  velocities  in  the  earth  from  the  observed  travel  times  of  seismic  body  waves  - what 
seismologists  call  the  geophysical  inverse  problem . In  applying  the  geophysical  inverse 
problem  algorithms  to  the  radio-occultation  inversion  problem.  Dr.  Grossi  recognized  that 
in  several  other  branches  of  physics,  inversion  techniques  are  mandatory  for  determination 
of  the  intended  scientific  results.  He  subsequently  suggested  to  the  National  Aeronautics 
and  Space  Administration  that  all  scientists  concerned  could  mutually  benefit  from  a 
symposium  at  which  the  various  individual  inversion  techniques  were  outlined,  discussed, 
and  explored  with  respect  to  their  interdisciplinary  applicability.  NASA  concurred  and 
requested  the  undersigned  to  organize  the  undertaking. 

To  ensure  adequate  representation  from  each  of  the  major  scientific  disciplines 
involved,  a Steering  Group  was  formed:  M D.  Grossi,  Raytheon;  J.  E Jackson,  Goddard 
Space  Right  Center;  H.  E.  Moses,  Air  Force  Cambridge  Research  Labs;  R.  G.  Newton, 
Indiana  University;  W.  Nordberg,  Goddard  Space  Flight  Center;  A.  M.  Peterson,  Stanford 
University;  R.  A.  Phinney,  Princeton  University;  and  J.  Shmoys,  Polytechnic  Institute  of 
Brooklyn.  Each  was  responsible  for  organizing  and  chairing  a single,  half-day  session  de- 
voted to  his  specialty.  This  responsibility  included  specifying  speakers  and  their  contribu- 
tions, as  well  as  recommending  other  participants.  Their  efforts  led  to  seven  formal  sessions 
devoted  to  the  following  disciplines:  passive  atmospheric  sounding,  active  atmospheric 
sounding,  occultation  measurements,  ionospheric  sounding,  particle  scattering,  electromag- 
netic scattering,  and  seismology.  Some  37  invited  papers  were  presented  and  discussed. 
These  formal  sessions  were  followed  by  two  informal  workshop  sessions  devoted  to  short 
voluntary  contributions  and  a panel  discussion  exploring,  in  depth,  the  interdisciplinary 
aspects  of  the  various  algorithms. 

For  lack  of  time,  several  scientific  disciplines  involving  inversion  problems  were  not 
formally  structured  into  the  program.  These  include  atmospheric  sounding  by  sonic  booms 
and  stellar  occultation,  structural  mechanics,  plasma  diagnostics,  lunar  conductivity  measure- 
ments, underwater  soundings,  biomedical  applications,  and  others.  However,  most  of  these 
areas  were  represented  by  participants  in  the  audience,  several  of  whom  made  short  contri- 
butions during  the  last  sessions. 

The  organization  of  these  proceedings  generally  follows  that  of  the  workshop. 

Seven  chapters  are  devoted  to  each  of  the  major  disciplines.  Written  accounts  of  the 
invited  papers  are  contained  therein,  each  followed  by  edited  accounts  of  the  subsequent 
discussions.  Each  chapter  concludes  with  a comprehensive  bibliography,  including  the 
references  from  the  individual  papers.  Chapter  8 contains  certain  of  the  extended  volun- 
tary contributions  offered  during  the  workshop  sessions,  as  well  as  other  short,  miscella- 
neous, but  important,  contributions  offered  during  the  panel  discussions.  Chapter  9 
contains  the  workshop  summary  papers. 

The  workshop  was  unique  in  that  neither  the  scientific  methods  nor  the  scientific 
results  were  emphasized  or  critiqued;  the  mathematical  techniques  and  algorithms  lead- 
ing to  those  results  were.  Of  course,  it  was  recognized  that  often  the  nature  and  form  of 
the  physical  profiles  lead  naturally  to  the  algorithms  involved,  and  thus  are  introduced 
invariably  in  the  papers,  but  the  scientific  meanings  of  these  profiles  are  not  important 


tv 


here.  In  organizing  the  workshop,  we  emphasized  scientific  areas  in  which  profile  inversion 
is  an  indispensable  part  of  experimentation.  However,  the  inversion  algorithms  run  the 
gamut  from  being  almost  nonexistent  to  being  highly  sophisticated.  1 feel  safe  in  saying 
that  each  participating  scientist  benefited  from  the  workshop  in  terms  of  the  primary  goal, 
learning  of  inversion  algorithms  developed  in  one  discipline  that  has  application  in  his  own. 
An  added  bonus,  perhaps  not  clearly  recognized  in  the  organizational  stages,  was  that  de- 
tailed exploration  of  the  algorithms  led  to  conceptual  thinking  for  planning  future  experi- 
ments. 

As  with  any  undertaking  of  this  scope,  there  are  many  people  who  contributed  to  the 
success  of  the  workshop.  The  members  of  the  Steering  Committee,  mentioned  above,  did 
an  outstanding  job  in  organizing  and  conducting  their  sessions  and  in  attracting  leading 
scientists.  Special  thanks  are  due  W.  Smith  who,  substituting  for  W.  Nordberg  at  the  last 
moment,  did  an  excellent  job  in  chairing  the  session  on  passive  atmospheric  sounding. 

Dr.  P.  C.  Sabatier  should  be  singled  out  for  his  illuminating  contributions  and  his  heroic 
summary  paper.  Several  Ames  personnel  helped  make  the  Workshop  and  the  publication 
of  this  volume  quite  a bit  easier  and  ie:s  frustrating:  Zelda  Ballantine,  C.  Duller,  R.  Maines, 
W.  Walling,  C.  Frost,  Carol  Tinling,  and  B.  Balandis.  Finally,  and  certainly  not  least, 

Dr.  E.  R.  Schmerling,  NASA  Headquarters,  provided  the  impetus  and  funding  that  made 
the  workshop  possible. 


Lawrence  Colin 
Moffett  Field,  California 

August  1972 
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1.  PASSIVE  ATMOSPHERIC  SOUNDING 


The  subject  of  this  chapter  is  the  ground-based,  airborne,  or  rpaceborne  measurement  of  emitted  terrestrial 
(planetary)  or  reflected  solar  radiation.  Radian,  e measurements  are  made  as  a function  of  wavelength  (visible, 
ultraviolet,  infrared),  look-angle,  and  atmospheric  optical  depth,  and  are  inverted  to  yield  the  vertical  structure  of 
several  atmospheric  parameters,  including  temperature,  ozone,  water  vapor,  and  trace  constituents.  Regardless  of 
the  measurement  platform  and  wavelength  band,  the  mathematical  prob!;:.i  is  ider  tical:  inversion  of  the  integral 
equation  of  radiative  transfer.  Sophisticated  statistical  and  nonstatistical,  linear  and  nonlinear  inversion  techniques 
are  discussed  and  compared.  The  difficulties  with  the  inversion  problem  are  both  mathematically  and  numerically 
oriented,  and  the  ultimate  choice  of  a “best”  method  for  a particular  application  may  not  be  based  or  accuracy  but 
is  restricted  normally  by  available  resources,  quantity  of  experimental  data,  and  a priori  knowledge. 

The  chapter  concludes  with  a comprehensive  bibliography  comprising  references  noted  in  the  text  as  well 
as  other  relevant  papers. 

W.  Nordberg  organized  the  session  devoted  to  passive  atmospheric  sounding;  however,  W.  Smith  chaired  the 
session  and  led  the  discussion  periods. 
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A REVIEW  OF  SOME  ASPECTS  OF  INFERRING  THE  OZONE  PROFILE 


BY  INVERSION  OF  ULTRAVIOLET  RADIANCE  MEASUREMENTS 

Carlton  L.  Mateer 

Atmospheric  Environment  Service,  Toronto,  Ontario 
N 7 3 *"  1 ^ r ^ ABSTRACT 


The  mathematical  inversion  of  light-scattering  observations  to  obtain  the  atmospheric  ozone  profile  is  discussed 
in  terms  of  the  filtering  properties  of  the  physical  and  mathematical  processes  for  different  spatial  scales.  Within  this 
context,  it  is  shown  that  the  physical  process  of  scattering,  whether  in  diffuse  transmission  or  in  diffuse  reflection, 
acts  as  a low-pass  filter;  which  transfers  large-scale  profile  information  efficiently  to  radiance  observations  but  very 
strongly  attenuates  small-  or  fine-scale  profile  information.  To  avoid  domination  of  the  mathematical  inversion  by 
the  random  error  of  radiance  measurements,  an  equivalent  spatial-scale  filtering  in  the  inversion  procedure  is 
- '.sential.  Methods  of  inversion  used  on  traditional  ground-based  Umkehr  measurements  and  on  satellite  measure- 
ments, and  the  accuracy  and  ut.lity  of  the  inversion  results,  are  reviewed.  The  available  evidence  suggests  that 
mathematical  inversion  to  obtain  the  low-level  ozone  profile  below  25  to  30  km  is  either  inferior  to  or  no  better 
than  the  statistical  estimation  of  the  profile  using  total  ozone  as  predictor.  However,  inversion  profiles  for  high-level 
ozone  above  25  to  30  km  appear  to  have  moderately  good  accuracy. 


INTRODUCTION 


Indirect  inferences  about  atmospheric  structure  were  made  nearly  a century  ago  from  the  unusual  twilight 
optical  phenomena  observed  following  the  famous  Krakatoa  volcanic  eruption.  The  existence  of  stratospheric  'u  si 
layers  was  inferred  [Russell  and  Archibald,  1888]  from  these  observations.  According  to  Mi. « [1952],  the 
existence  of  high  temperatures  in  the  upper  stratosphere  was  postulated  early  in  the  present  century  from  observa- 
tions of  anomalous  sound  propagation.  The  first  truly  quantitative  indirect  profiling  probably  came  with  the  intro- 
duction in  1926  of  radio  methods  [Mitra,  1952]  for  determining  ionospheric  electron  density  profiles.  Several  years 
later,  measurements  of  diffusely  transmitted  solar  ultraviolet  radiation  were  used  to  infer  the  main  features  of  the 
atmospheric  ozone  profile  [Gotz  et  al. , 1934] . In  their  classic  paper,  Gotz  et  al.  gave  the  fust  hints  about  the 
fundamental  problems  that  exist  in  the  inversion  of  radiance  measurements  to  obtain  information  on  atmospheric 
profiles.  However,  although  quantitative  profile  inversion  is  at  least  40  years  old  and  has  undergone  very  rapid 
development  in  the  past  decade,  it  remains  a rapidly  developing  area  of  scientific  interest. 

This  discussion  is  restricted  to  the  inference  of  the  vertical  ozone  profile  by  the  mathematical  inversion  of 
radiance  measurements  of  solar  ultraviolet  radiation  scattered  by  the  terrestrial  atmosphere.  The  nature  of  the  ozone 
pr-~  ~le  in  the  lower  atmosphere  is  illustrated  in  figure  !,  which  statistically  summarizes  over  500  direct  profile 
measurements  obtained  by  balloon  sounding  at  Boulder,  Colarado  [Diitsch,  1966;  Diitsch  et  al. , 1970] . The  curves 
are  for  different  total  amounts  of  ozone  given  in  atm-cm.  The  main  features  of  the  ozone  profile  are  the  pronounced 
maximum  occurring  at  22-24  km,  the  tendency  to  a secondary  maximum  in  the  lower  stratosphere  near  1 3-14  km 
when  the  total  ozone  amount  is  high,  and  the  approximately  exponential  decrease  of  ozone  with  height  above  about 
30  km. 

The  inference  of  the  ozone  profile  from  light  scattering  measurements  may  be  attempted  from  ground 
(diffuse  transmission)  or  satellite  (diffuse  reflection)  measurements.  In  either  case,  the  radiation  scattered  by  the 
terrestrial  atmosphere  comes  mostly  from  a fairly  well-defined  layer,  the  effective  scattering  layer.  The  vertical 
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position  of  this  layer  depends  on  the  ozone  absorption 
coefficient  at  the  wavelength  under  consideration,  the 
zenith  angle  of  the  sun,  ihe  direction  of  observation, 
and  the  vertical  distribution  of  ozone  in  the  atmosphere 
[Da\  e and  Mateer,  1967],  For  satellite  measurements 
in  the  nadir  direction,  these  scattering  layers  are  shown 
in  figure  2,  which  applies  to  a solar  zenith  angle  of  60° 
and  a total  ozone  amoun*  of  0.336  atm-cm.  The  curves 
show  the  relative  amount  of  energy  backscattered  at 
serious  levels  in  the  atmosphere  for  several  wavelengths, 
with  the  ozone  absorption  coefficient  varying  from 
about  300  atm-cm-1  at  25S5A  to  0.05  atm-cm-1  at 
3398A.  The  area  between  each  curve  and  the  y axis  is 
directly  proportional  to  the  backscattered  radiance 
observed  at  the  satellite,  and  all  orders  of  molecular 
scattering  have  been  included  in  the  calculation.  Similai 
curves  apply  to  the  case  of  diffuse  transmission,  as 
illustrated  in  figure  3,  where  the  vertical  “scanning” 
effect  is  achieved  by  allowing  the  solar  zenith  angle  to 
vary  for  2 or  3 hr  near  sunset  or  sunrise.  Although  the 
curves  in  figure  3 were  obtained  with  single  scattering 
calculations,  multiple  scattering  curves  exhibit  the  same 
kind  of  scanning  effect. 


SOME  PROPERTIES  OF  THE  INTEGRAL  EQUATION 


50  100  SO  200 

OZONE  PARTIAL  PRESSURE  (Mmb> 


The  inference  of  atmospheric  profiles  from 
radiance  measurements  usually  involves  the  inversion  of 
an  integral  equation  of  the  form 


Figure  1.-  Average  vertical  ozone  profiles  at  Boulder. 
Colorado,  for  different  total  ozone  amounts. 


J/f(x,y)/t.v)dx  = *00  0) 

where  the  gCv)  are  radiance  measurements  at  various  values  of  y,K(x,y)  is  the  so-called  "kernel,”  and  f(x ) is  the 
unknown  atmospheric  profile.  In  somt  cases,  the  equation  may  be  of  the  form 

f*l/Tx);x,.vMx)d*-*0''  (2) 


j*  LtoO;  X,y\ Ax 


In  all  cases,  these  are  reducible  to  a matrix  equation 


Af-g 
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or,  if  Af0  * g0,  Af  ■ f - f0  and  Ag  - g - go 
where  fo.go  reqresent  some  standard  or  average 
conditions,  then 


AAf  = Ag  (5) 

Throughout  this  paper,  boldface  symbols  indicate 
matrixes  or  vectors;  other  letter  symbols  refer  to 
scalars.  In  profile  inversion,  the  goal  is  to  estimate 
the  unknown  profile  Af  from  radiance  measure- 
ments Ag,  which  contain  random  and  nonrandom 
errors  of  measurement.  Other  errors  generally 
inherent  in  (S)  include  physical  processes  that  may 
have  been  neglected  in  the  mathematical  model  of 
the  atmosphere  described  by  ( 1),  (2),  or  (3)  and  the 
linearization  of  what  may  be  a nonlinear  equation 
that  is  (2)  or  (3).  There  may  also  be  small  quadrature 
errors  in  (5). 

The  fundamental  problems  of  inverting  equa- 
tions such  as  (5),  when  they  represent  real  atmo- 
spheric physical  processes  and  involve  random  errors, 
have  been  discussed  extensively  during  the  past 
decade  by  Twomey  [1963, 1965, 1966] , Twomey 
and  Howell  [1963, 1967]  ^Matter  [1964, 1965] , and 
others.  The  discussion  here  is  drawn  primarily  from 
their  work. 


Figure  2.  Contribution  to  the  nadir-direction  radiance  by 
backscattering  at  various  levels  in  the  atmosphere  for 
the  case  of  satellite  observations. 


For  real  situations,  the  problems  arise  first  from 
the  smoothness  and  large-scale  nature  of  the  kernels, 
directly  comparable  to  the  curves  of  figure  2 for  (3); 
and  second  from  the  random  errors  in  (5).  In  general, 
the  vector  Ag  will  comprise  n measurements  (say 
4 to  12).  However,  so  that  the  mathematical  quadra- 
ture will  not  add  appreciably  to  the  random  error  in 
(5),  the  vector  Af  will  generally  comprise  specifications  of  A/  at  m (>«)  points  (say  20  to  40,  but  perhaps  100 
or  more).  Immediately,  then,  the  system  of  equations  is  underdetermined,  admitting  an  infinite  manifold  of  solu- 
tions for  Af  that  will  satisfy  (5)  exactly.  Frequently,  this  difficulty  is  avoided  by  constraining  the  solution  to  some 
ad  hoc  form  so  that  m<n  unknowns  are  sought  in  the  solution.  At  this  point,  however,  there  is  no  need  to  artifi- 
cially restrict  m ; indeed,  it  is  convenient  to  suppose  that  m is  sufficiently  large  that  Af  rnay  adequately  describe 
all  features  (including  the  fine  structure)  commonly  found  in  atmospheric  profiles. 

it  is  instructive  to  think  of  A as  a transformation  matrix  that  transforms  from  the  fx  “profile”  plane  into 
the  gy  “observation”  plane  and  to  examine  the  amplitude  transmission  of  different  spatial  scales  from  the  fx  to 
the  gy  plane.  In  doing  this,  it  is  convenient  to  use  the  eigenvectors  that  specify  the  “principal  axes”  pertinent  to 
the  rows  of  A.  We  consider  the  standard  matrix  algebra  equation 


W*  A* AW  « A 


(6) 


where  the  (orthonormal)  eigenvectors  are  the  columns  of  W,  the  asterisk  denotes  matrix  transposition,  and  A is 
the  completely  diagonal  matrix  having  the  eigenvalues  of  A*  A on  its  diagonal  and  zeroes  elsewhere.  Note  that  if 
m>n , there  are  at  most  n nonzero  eigenvalues  and,  hence,  at  most  n nontrivial  eigenvectors. 
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HEIGHT  1 (km) 


Figurt  3.-  Contribution  X(t)  to  the  zenith-direction  radiance  by  primary  Mattering  at  varioui 
levels  in  the  atmoaphera  for  the  cau  of  ground-bated  obeervatson*  [Milter,  1964) . 
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TABLE  1.  EIGENVALUES  AND  EIGENVECTORS  FOR  A TYPICAL 
MATRIX  |after  Matter,  1965] 


Vector 

Eigenvalue1 

1 

2 

3 

Eigenvector  points 
4 5 6 

7 

8 

9 

1 

4 68 

(1) 

-0.180 

-0.471 

-0.462 

-0  405 

-0.321 

-0.125 

0.136 

0.308 

0 368 

2 

1 55 

(1) 

-0.087 

-C.022 

0 160 

0.262 

0.374 

0.441 

0.515 

0.399 

0.37! 

3 

4.15 

(0) 

0.334 

0.472 

0.248 

-0.004 

-0.231 

-0.356 

-0.163 

0 334 

0.532 

4 

9 17 

(-1) 

0.682 

0.187 

-0.219 

-0.297 

-0.166 

0.227 

0.468 

-0.020 

-0.254 

5 

1 16 

(-1) 

-0.493 

0.282 

0.263 

3.045 

-0  391 

-0.266 

0.528 

0.096 

-0.306 

6 

947 

(-3) 

0 263 

-0  299 

-0.082 

0.288 

-0.347 

-0.651 

0.180 

0.310 

-0.283 

7 

1.58 

(-3) 

-0  236 

0 456 

-0.273 

-0.489 

0.521 

-0.031 

-0.156 

0.311 

-0.171 

X 

341 

(-4) 

-0.122 

0.378 

-0.698 

0.510 

-0  064 

-0.080 

0.118 

-0.211 

0.159 

9 

1.03 

(-5) 

0.009 

0.006 

-0.115 

0.306 

-0.356 

0.331 

-0.354 

0.622 

-0.388 

1 Number  in  parentheses  is  power  of  10  by  which  preceding  number  is  to  be  multiplied. 


We  expand  the  profile  Af  in  terms  of  the  eigenvectors;  thus 


Af  = Wb 


(7) 


where  b is  a column  vector  of  coefficients.  This  expansion  is  perfectly  general 

expanded  exactly  in  terms  of  m orthonormal  vecto'S.  Moreover,  the  variance 
m , 

given  by  2 b : , since  W*W  ■ 1 , where  I is  the  identity  matrix.  Table  1 
/"I 


in  the  sense  that  Af  can  be 
m , 

Vf  ■ 2 A ft  , of  A / is  alsc 
J i*i 

lists  eigenvalues  and  eigenvectors 


for  a case  where  m * 9 ; the  eigenvalues  are  listed  in  order  of  decreasing  magnitude  while  the  corresponding 
eigenvectors  are  in  the  appropriate  row  of  the  table.  Note  that  the  low-order  eigenvectors  correspond  to  the  large 
spatial  scales  (low  frequencies)  in  the  profile  plane,  whereas  the  higher-ord.,  eigenvectors,  which  exhibit  frequent 
sign  changes,  correspond  to  small  spatial  scales  (fine  structure,  high  frequencies).  Therefore,  the  low-order  eigen- 
vectors will  explain  the  variance  in  Af  that  arises  from  the  larger  spatial  scales  in  the  fx  plane,  while  the  high- 
order  eigenvectors  will  explain  the  variance  in  A / that  arises  from  the  small  spatial  scales. 


If  we  now  substitute  (7)  in  (5)  and  Mculate  Vg  , the  variance  in  the  gy  observation  plane,  we  have 


Vg  - Ag*Ag  « b*W*A*A’Vb  - b*Ab 


(8) 


where  Ag*.  b*  are  row  vectors.  The  above  result  states  that  the  fth  spatial  scale  is  amplified  by  X;  in  the  trans- 
formation from  ttu.  profile  plane  to  the  observation  plane.  Thus  the  fine  structure  in  the  profile  plane,  which  is 
described  by  the  high-order  eigenvectors  having  very  small  associated  eigenvalues,  is  very  strongly  reduced  in  ampli- 
tude in  the  observation  plane.  Indeed,  when  m>n , the  very  fine  structure  in  the  profile  plane  will  be  explained  by 
very  high-order  eigenvectors  for  which  the  X/  are  identically  zero  (n  </<m  ) and  will  not  appear  at  all  in  the 
observation  plane  This  information  about  the  profile  is  completely  inaccessible,  even  in  n error-free  radiance 
measurements  [Twomey  and  Howell,  1967).  However,  for  practical  purposes,  the  fine  structure  information  con- 
tent of  the  observations  is  so  reduced  in  amplitude  that  it  is  indistinguishable  from  the  random  error  present  in  Ag. 
These  results  are  direct  consequences  of  the  smoothness  and  large  scale  of  the  kernel  functions  of  the  basic  integral 
equations.  As  suggested  by  Twomey  [1965] , the  physical  process  represented  by  the  transformation  matrix  A is 
really  a low-pass  filter,  transmitting  low-frequency  (large-scale)  information  quite  readily,  but  providing  strong 
attenuation  to  high-frequency  (fine-structure)  information. 
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There  are  two  obvious  consequences  of  the  above  result.  Fiist,  if  one  examines  many  observations  of  Ag,  as 
Mateer  [1964, 1965]  did  for  Umkehr  observations,  most  of  the  variance  can  be  explained  by  large-scale  components, 
suggesting  that  there  were,  at  most,  four  lii.early  independent  nieces  of  information  in  traditional  Umkehr  observa- 
tions. An  equivalent  result  is,  for  example,  that  of  twelve  observations  in  the  same  Ag  vector,  eight  could  be  pre- 
dicted from  the  remaining  four  within  the  observational  error  [Twomey,  1966] . 

The  second  and  more  important  consequence  is  that  when  one  attempts  to  recover  Af  from  observations  of 
Ag  by  mathematical  inversion  of  (5),  the  reciprocal  of  X,-  enters  into  the  picture.  Substituting  (7)  in  (5)  and  solving 

for  b by  least  squares  (restricting  b to  m '<n  elements,  where  m'  is  the  number  of  nonzero  eigenvalues),  we 
have 

b = A-1  W* A* Ag  (9) 

where  b is  the  estimate  of  b.  Therefore,  since  b/aXJ1  , the  fine  scale  in  Ag  (i.e.,  the  random  error)  is  greatly 
amplified  in  the  inversion  by  reason  of  the  very  small  high-order  eigenvalues,  and  the  random  observational  errors  in 
Ag  dominate  the  inversion  procedure.  Gearly.  some  analog  to  the  natural  filtering  inherent  in  (5)  is  an  essential 
ingredient  of  any  meaningful  mathematical  inversion  method. 

An  obvious  method  of  achieving  this  filtering  is  to  restrict  b to  m"  elements,  where  \m"  is  the  smallest 
eigenvalue  for  which  the  corresponding  eigenvector  is  relatively  uncontaminated  by  measurement  errors.  It  is  very 
nearly  equivalent  to  one  of  the  methods  proposed  by  Twomey  [1963]  wherein  the  inversion  solution  is  constrained 
to  be  a minimum  departure  from  fa  [see  also  Philips,  1962] . This  solution  is 


Af  = (A*A  + >1)"*'  A*Ag  (10) 

where  7 is  a Lagrangian  multiplier  whose  magnitude  is  determined  in  practice  by  trial  and  error,  bu.  is  implicity  set 
by  the  error  in  Ag  and  the  natural  bounds  on  Af.  It  can  easily  be  shown  that  the  following  forms  are  identical  to 
Eq.  (10),  namely 


Af  = Wb  = W(A  + 7l)~,W*A*Ag 


O'.) 


TABLE  2.-  SOLUTION  CONTRIBUTION  BY  f ACH  EIGENVECTOR  FOR  AROSA  UMKEHR 
OF  21  MARCH  1962.  TOTAL  OZONE  IS  0.40?  ATM-CM.  [Mateer,  1965] 


Eigen- 
vector Vector  Explained  Solution  contribution  for  each  eigenvector 

number  coefficient  variance  Residual  layer-mean  partial  pressures  («imb) 


1 

bi 

X/6/ 

variance 

1 

2 

3 

4 

5 

6 

7 

8 

9 

Standard  distribution 

204.4 

23 

42 

84 

133 

134 

95 

53 

20 

7 

1 

0.225 

14 

202  0 

0 

-3 

-2 

-2 

-1 

0 

0 

0 

0 

2 

2.145 

71.3 

130.7 

-2 

-1 

8 

10 

11 

8 

7 

3 

1 

3 

5.433 

122.5 

8.2 

22 

77 

32 

0 

-18 

-17 

-5 

5 

4 

4 

-2.590 

6.1 

2.1 

-21 

-14 

14 

14 

6 

-5 

-7 

0 

1 

5 

1.21a 

0.2 

1.9 

-7 

10 

8 

-1 

-7 

-3 

4 

0 

-1 

6 

0.987 

0.005 

1.9 

3 

-9 

-2 

5 

5 

-6 

1 

1 

0 

7 

-15.49 

0.4 

44 

-212 

101 

136 

-113 

4 

14 

4 

i 

8 

33.87 

0.4 

1.1 

-SC 

384 

-568 

311 

-30 

-24 

24 

-21 

~ 

9 

-43.79 

0.05 

1.0 

-5 

-8 

-341 

218 

-130 

91 

-82 

25 

Final  solution  (four  vectors) 

22 

101 

136 

i:; 

81 

46 

28 

12 

f 

’ 7 

7- 

* 


TABLE  3 - SOLUTION  CONTRIBUTION  BY  EACH  EIGENVECTOR  USING  TWOMEY'S  METHOD 
WITH  7 = 0 5 FOR  AROSA  UMKEHR  OF  21  MARCH  1962.  TOTAL  OZONE  IS  0.403  ATM-CM 

[Matter,  1965) 


Eigen-  ' 
vector 
number, 

i 

Vector 

coefficient. 

Explained 
variance 
(X,  + y)l>i 

Residual 

variance 

1 

Solution  contribution  for  each  eigenvector 
layer-mean  partial  pressures  ( umb ) 

2 3 4 5 6 7 8 

9 

Standard  distribution 

204.4 

23 

42 

84 

133 

134 

95 

53 

20 

7 

1 

0.224 

2.4 

202.0 

0 

-3 

-2 

-2 

-1 

0 

0 

0 

0 

2 

2.078 

69.1 

132.9 

-2 

1 

8 

10 

11 

8 

6 

2 

1 

3 

4.849 

109.3 

23.6 

19 

69 

29 

0 

-16 

-15 

-5 

5 

4 

4 

-1.676 

4.0 

19.6 

-14 

-9 

9 

9 

4 

-3 

-5 

0 

1 

5 

.229 

.03 

19.6 

-1 

2 

1 

0 

1 

0 

1 

0 

0 

6 

.018 

0 

19.6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

7 

-.049 

0 

19.6 

0 

-1 

0 

0 

0 

0 

0 

0 

0 

8 

.023 

0 

19.6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

9 

.008 

0 

19.6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Final  solution 

25 

101 

129 

150 

131 

85 

50 

27 

13 

Af  = A*(AA*  + yl)-1  Ag 


02) 


Equation  (11),  where  b/#(Xj  + 7)-1  , is  the  filtered  analog  of  (9).  It  is  clear  that  the  magnitude  of  y controls  the 
filtering  effectiveness  of  the  inversion.  We  choose  y sufficiently  small  that  the  large-scale  features  of  Af  are  trans- 
mitted essentially  undamped,  but  sufficiently  large  to  severely  attenuate  the  fine-structure  features  that  would  other- 
wise appear  in  the  solution.  If  we  substitute  (7)  in  (1 1)  and  make  use  of  (6)  we  have 


r b / Xj 

Dj  = 

X/  + y 


(13) 


In  the  double  filtering  (the  physical  filtering)  described  by  (5),  followed  by  the  inversion  mathematical  filtering 
described  by  (1 1),  the  original  vector  coefficient  b/  is  reduced  by  the  factor  (1  + y l\i)~l . These  methods  are 
illustrated  in  table  2,  which  lists  results  according  to  Eq.  (9),  with  the  final  solution  taken  as  the  sum  of  the  first 
four  vector  contributions,  and  in  table  3,  which  lists  results  for  Eq.  (1 1)  with  y = 0.5.  These  tables  also  show  the 
progressive  reduction  of  the  variance  Vg  of  the  observations  as  each  vector  contribution  is  added  to  the  solution. 

In  summary,  the  inference  of  atmospheric  profiles  from  radiance  observations  is  not  so  much  a problem  of 
solving  n linear  equations  for  m unknowns  as  it  is  a problem  of  establishing  the  degree  of  filtering  that  assures 
proper  transformation  of  the  information  content  of  the  observations  into  profile  information,  and  filtering  out 
and  discarding  of  the  spurious  contributions  from  the  random  noise  of  measurement.  The  relative  magnitude  of 
m with  respect  to  n is  of  no  consequence  in  the  presence  of  adequate  filtering,  except  from  the  point  of  view  of 
accuracy  in  (5)  and  computational  economy.  Thus  (10)  should  be  used  if  n>m  or  Eq.  (12)  if  n<m;  either  of 
these  is  preferable  to  Eq.  (1 1)  or  truncated  Eq.  (9)  because  matrix  inversion  is  generally  computationally  faster  than 
calculation  of  eigenvalues  and  vectors.  Filtering  may  be  accomplished  by  the  relatively  direct  methods  described 
above  or  by  indirect  methods  such  as  those  described  by  Chahine  [1968] , Conrath  [1969] , and  Smith  [1970] . 


1-8 


DIFFUSE  TRANSMISSION:  GROUND-BASED  MEASUREMENTS 


The  classical  example  of  diffuse  transmission  is  the  so-called  “Umkehr  effect,”  which  was  first  observed  by 
Gotz  [ 1931  ] and  was  used  by  Gotz  et  al.  [1934]  to  infer  the  main  features  of  the  ozone  profile.  It  was  known  at 
that  time  that  most  of  the  atmospheric  ozone  must  be  present  in  a layer  somewhere  above  the  earth’s  surface  be- 
cause the  chemically  measured  ozone  concentrations  near  the  surface  were  far  too  low,  if  the  ozone  were  mixed 
uniformly  through  the  entire  atmosphere,  to  account  for  the  total  ozone  amounts  determined  optically  from  the 
measured  attenuation  of  the  direct  solar  beam.  However,  the  precise  height  or  center  of  gravity  of  the  ozone  layer 
was  uncertain. 


The  Umkehr  effect  is  observed  when  measurements  < re  made  of  the  zenith  sky-light  radiances  at  two  wave- 
lengths in  the  solar  ultraviolet  with  the  sun  near  the  horizon.  If  the  ratio  of  the  radiance  of  the  shorter  (more 
strongly  absorbed)  wavelength  to  the  radiance  of  the  longer  (weakly  absorbed)  wavelength  is  plotted  against  the 
sun’s  zenith  angle,  this  radiance  ratio  decreases  as  zenith  angle  increases  until  a minimum  is  reached  for  a zenith 
angle  of  about  85°  (when  the  wavelengths  are  3114  and  3324A).  As  the  zenith  angle  increases  further,  the  latio 
increases  again.  The  effect  is  illustrated  in  figure  4,  where  (he  ordinate  is  -100  times  the  common  logarithm  of  the 
radiance  ratio.  Gotz  reasoned  that  the  angular  position  of  the  reversal  was  related  to  the  height  of  the  ozone  layer 
and  that  it  occurred  when  the  effective  scattering  layer  for  the  shorter  wavelength  had  risen  above  the  ozone  layer 
(see  fig.  3). 


In  obtaining  the  first  indirect  profiles,  Gotz  et 
al.  [1934]  recognized  many  of  the  difficulties  that  have 
been  “rediscovered”  in  more  recent  years-namely,  the 
strong  interdependence  between  the  points  on  the 
Umkehr  curve  and  the  fact  that  the  shape  of  the  curve 
depends  mainly  on  the  total  amount  of  ozone.  By 
dividing  the  atmosphere  into  rather  thick  (IS  km) 
layers  and  assuming  constant  ozone  distribution  in  each 
layer,  Gotz  et  al.  were  able  to  obtain  a block  (histogram) 
solution  through  which  they  later  drew  a smooth  curve 
so  that  the  amount  of  ozone  in  each  layer  was  the  same 
as  in  the  block  distribution,  but  ozone  density  varied 
smoothly  with  height.  They  correctly  inferred  that  the 
ozone  center  of  mass  was  just  above  20  km  for  average 
midlatitude  total  ozone  amounts,  and  that  the  main 
ozone  changes  accompanying  changes  in  total  ozone 
occurred  in  the  lower  stratosphere  so  that  the  center 
of  gravity  became  lower.  Their  main  result  was  verified 
by  direct  measurement  by  Regener  and  Regener  [1934] , 
who  sent  a spectrogr?ph  aloft  by  balloon  to  32  km. 

Because  only  the  radiance  ratio,  and  not  the  indi- 
vidual radiances,  is  available  for  Umkehr  observations, 
the  equation  usually  used  in  recent  years  is  [Mateer  and 
Dutsch,  1964] 

m 

V bxj=N{0)-U(e)-S(6)  (14) 

“ dXi 


ZENITH  ANGLE  (DEGREES) 


Figure  4.-  Umkehr  curve  for  observations  at  Etfmonton,  Canada, 
on  the  afternoon  of  May  22, 1961  \Mateer,  1964) . 
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where  6 is  the  sun’s  zenith  angle  and 


Nm  = 100  [ log  U(8)IF0)3SM  - log(/(0)/Fo),ll4)  (15) 

The  quantity  N may  be  thought  of  as  an  observed  relative  logarithmic  attenuation  (having  units  of  centibels),  V is 
the  relative  loganthmic  attenuation  for  some  “standard”  ozone  distribution,  x,-  is  the  amount  of  ozone  in  the  ith 
layer  of  the  numerical  model  of  the  atmosphere,  and  S(0)  represents  nonlinear  effects  ignored  in  the  first-order 
Taylor  expansion.  Customarily,  measurements  are  taken  while  the  sun’s  zenith  angle  varies  between  60°  and  90°. 
The  equivalence  of  Eq.  (14)  to  Eq.  (5)  is  clear:  A is  the  matrix  of  first-order  partial  derivatives,  the  Ax,-  are  the 
elements  of  Af,  and  N(6)- U(d)~  S(6)  comprise  the  elements  of  Ag.  The  equation  is  solved  iteratively, 

m 

I *rv,-sr  (16) 

»=i  1 


where  the  superscript  v refers  to  the  iteration,  the  subscript 


1C1U9 


p—  , 


H 


l 


a iuj 

dxjdxk 


(Ax, /(Ax;)" 


(17) 


The  iterations  are  continued  until 

Sh'-sr  i <6  "s> 

/ 


where  6 is  some  suitably  small  number.  Further  details  of  this  method  may  be  found  in  Matter  and  Dutsch  [ 1 964] . 

When  solutions  of  this  system  are  obtained  by  the  filtering  methods  of  Twomey  described  earlier,  they  are  found 
to  e:., libit  large-scale  differences  from  concurrent  (nearly  simultaneous)  direct  balloon  sounding  results  [Matter  and 
Dutsch,  1964;Bo/Jtov,  1966;  Craig  etaL,  1967] . In  particular,  these  Umkehr  results  show,  on  the  average,  too  little 
ozone  near  the  main  maximum  and  too  much  ozone  at  lower  levels  in  the  atmosphere. 

Since  the  analysis  of  the  scale  transmission  properties  of  the  inversion  procedure  indicates  that  the  large-scale 
features  of  the  profile  should  come  through  with  good  fidelity,  one  suspects  that  the  errors  are  introduced  by  in- 
adequacies in  either  he  physical  model  cf  the  atmosphere  or  the  treatment  of  nonlincarities,  or  both.  DeLuisi  [1969] 
has  shown  that  a bettor  treatment  of  the  first  of  these  produces  only  partial  improvement.  Consequently,  one  con- 
cludes that  the  nonlinearities  must  be  important  and,  in  particular,  that  a poor  choice  for  the  standard  distributions 
from  which  the  solutions  are  computed  may  be  responsible. 

The  optimum  statistical  inversion  method  [Rodgers,  1966 \ Strand  and  Westwater,  1968]  has  been  applied  to  the 
evaluation  of  Umkehr  observations  by  DeLuisi  and  Mateer  [1971].  The  mathematical  basis  for  this  method  is  described 
ir.  detail  by  C.  D.  Rodgers  in  the  next  section.  DeLuisi  and  Mateer  used  Eq.  (16)  in  the  form 

LT±;^-',rv,  <“> 

/“i 
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TABLE  4.-  VARIANCE  OF  BOULDER  OZONE  DISTRI- 
BUTIONS AFTER  APPLICATION  OF  THE  TOTAL 
OZONE  REGRESSION.  VARIANCE  UNITS  ARE 
(Ioge)2  [DeLuisi  and  Matter.  19711 


where  In  = 10^.  They  ignored  nonlinear  effects  and 
did  not  perform  iterations  on  the  premise  that  a good 
statistical  first  guess  was  used  as  the  standard  distribu- 
tion. Here  the  optimum  solution  profile  is  given  by 


Layer 

L i , er  boundaries, 
mb 

Variance 

1 

1000-250 

0.0814 

2 

250-177 

0.3763 

3 

177-125 

0.2529 

4 

125-88.4 

0.1332 

*> 

88.4-62.5 

0.0430 

f. 

62.5-44.2 

0.0194 

v 

44.2-31.2 

0.0109 

S 

31.2-22.1 

0.0059 

9 

22.1-15.6 

0.0107 

13 

15.6-11.0 

0.0205 

li 

11.0-7.81 

0.0263 

12 

7.8I-S.52 

0.0260 

13 

5.52-3.91 

0.0185 

14 

3.91-2.76 

0.0100 

IS 

2.76-1.95 

0.0041 

16’ 

1.95-1.38 

0.0008 

1 Ozone  amount  above  the  top  of  layer  16  is  1.7 
times  t’  e ozone  content  of  layer  16. 


ozone  in  the  atmosphere,  some  assumption  had  to  be 


Af  = SfA*  (ASfA*  + Ser‘  Ag  (20) 

where  Sj-  is  the  covariance  matrix  of  the 
ozone  profile  (the  elements  are  in  effect 

l L 

(Sf)j / = L A ln  *ik  ~ Alnx/k  • where 

subscript  k refers  to  the  k th  profile  in  the  sample 
and  AlnXyj.  is  the  logarithmic  deviation  m the  i th 

layer  between  the  true  profile  and  the  first  guess  pro- 
file) and  Sf  is  the  covariance  matrix  for  the  random 
errors  present  in  Ag  . 

The  first-guess  ozone  profile  was  obtained  by  a 
log-linear  regression,  using  total  ozone  as  predictor, 
and  was  based  on  the  500  odd  direct  balloon  sound- 
ings at  Boulder  (fig.  1).  Since  the  direct  balloon 
soundings  normally  terminate  at  about  30  km.  but 
the  radiance  measurements  are  influenced  by  all  the 
about  the  ozone  profile  above  30  km.  This  was  done  by 


smoothly  mergi  g each  observed  profile  onto  Dutch’s  [1959]  standard  profile  in  such  a way  that  all  profiles  in  the 
statistical  sampl  were  identical  above  about  45  km.  As  indicated  above,  the  matrix  Sy  was  calculated  from  the 
deviations  of  eatn  individual  profile  from  the  corresponding  first-guess  profile.  The  diagonal  elements  of  Sy  are 
listed  in  table  4 along  with  the  presc'  .e  boundaries  for  each  layer  in  the  evaluation  model. 


TABLE  5 - AVERAGE  AND  VARIANCE  OF  EMPIRICAL  ADJUSTMENT  TO  OBSERVED 
UMKEHR  CURVES  AT  VARIOUS  LOCATIONS  [DeLuisi  and  Mateer,  19711 


Solar  zenith 

angles, 

oeg 

Arosa 

Average, 

cb 

Var'inf'e, 

(c*)3 

Average 

cb 

Aspendale 
, Variance, 

( cb y 

Tallahassee 

Average,  Variance. 

cb  (cb)1 

65 

-0.10 

0.27 

0) 

(>) 

0.10 

0.18 

70 

-0 

1.00 

0) 

0) 

0.07 

0.58 

74 

-0.45 

1.95 

-0.23 

0.44 

-0.07 

1.12 

77 

-0.78 

3.47 

-0.34 

0.76 

-0.27 

1.77 

80 

-0.91 

4.94 

-0.31 

1.68 

0.10 

2.44 

83 

-0.22 

8.12 

1.14 

4.00 

1.00 

4.73 

85 

0.96 

7.83 

2.97 

6.48 

2.63 

5.87 

86 

1.69 

8.01 

4.41 

8.47 

3.96 

6.02 

SO 

2.27 

8.80 

5.48 

12.04 

4.63 

5.22 

89 

1.93 

10.17 

5.61 

13.58 

4.18 

4.11 

90 

.8) 

11.06 

5.06 

16.15 

3-23 

3.22 

1 The  smallest  zevm  c,igle  available  for  all  umkehis  at  Aspendale  is  70s . The  N -value  at  70°  is  subtracted  from  the 
A-values  at  the  .emainhtg  zenith  angles  to  remove  the  instrumental  constant. 


DeLuisi  and  Matecr  next  estimated  Sf  from  samples  of  nearly  simultaneous  Umkehr  measurements  and 
directly  measured  ozone  profiles  at  Arosa  (Switzerland),  Aipendale  (Australia),  and  Tallahassee  (Florida).  They  did 
this  by  using  each  profile  and  (19)  to  compute  the  Nj  values  that  “should”  be  observed  for  each  profile.  The  dif- 
ferences between  the  observed  Nj  and  these  computed  Nj  were  considered  representative  of  the  effects  of  errors 
of  measurement  not  only  of  the  radiance  ratios  but  also  of  the  ozone  profiles,  and  to  include  errors  introduced  by 
inadequacies  of  the  physical  model  of  the  atmosphere  and  the  nonlinearities  ignored  in  (19).  The  average  differences 
Nj  - Nj  are  listed  in  table  5 along  w'  ‘h  the  diagonal  elements  of  S*  . Solutions  were  obtained  from  (20)  after 
first  adjusting  the  actual  observed  Umkehr  data  according  to  the  average  Nj  - Nj  at  each  location. 

Results  are  presented  in  tables  6,  7,  and  8 for  Arosa,  Aspendale,  and  Tallahassee,  respectively.  In  each  table 
the  second  column  lists  the  average  ozone  profile  in  tenns  of  the  mean  ozone  partial  pressure  in  micromillibars 
(#imb)  in  each  layer.  The  third  and  fifth  columns  list  the  average  first-guess  and  solution  profiles,  respectively.  The 
fourth  and  sixth  columns  list  rms  errors  for  these  profiles.  At  the  bottom  of  each  average  column  is  the  average  total 
ozone,  while  at  the  bottom  of  each  rms  column  is  the  overall  rms  error,  for  layers  1 through  1 1 , in  terms  of 
AlnJ?/ , in  natural  log  units.  The  overall  rms  error  is  a measure  of  the  overall  fractional  error  in  the  layers  for  which  the 
balloon  sounding  data  were  available. 

The  tables  indicate  that  the  optimum  statistical  method  achieves  only  modest  improvement  over  the  first-guess 
profile  obtained  from  the  total  ozone  regression.  Since  this  modest  improvement  was  attained  through  the  use  of  an 
average  empirical  adjustment  vector  derived  from  the  dependent  sample,  the  utility  of  Umkehr  observations  for 
deriving  additional  information  about  the  ozone  profile  below  30  km  is  seriously  in  doubt.  (When  the  adjustment 
vector  was  not  used,  there  was  a slight  improvement  over  the  regression  profile  at  Arosa  where  the  overall  rms  error 
was  reduced  from  0.298  to  0.280  and  at  Aspendale  where  the  reduction  from  0.2S6  to  0.220,  but  at  Tallahassee 
there  was  a trivial  increase  from  0.305  to  0.306.)  The  extent  to  which  the  residual  error  in  the  ozone  profile  is  fine 


TABLE  6.-  VERTICAL  OZONE  DISTRIBUTION  AT  AROSA  IN  TERMS  OF  LAYER-MEAN  OZONE 
PARTIAL  PRESSURES.  SAMPLE  SIZE  » 49  [DtLuisi  and  Matter.  1971  ] 


Layer 

Measured 

average, 

limb 

Total  ozone 
rcgrearion 
Avenge,  RMS, 

limb  limb 

Solution 

Avenge,  RMS, 

limb  iimb 

1 

25.5 

23.8 

7.0 

25.9 

6.6 

2 

41.8 

3Z1 

20.3 

36.7 

17.6 

3 

59.9 

44.1 

23.9 

45.5 

23.8 

4 

74.1 

60.1 

21.2 

61.5 

18.9 

5 

108.4 

101.3 

17.9 

100.4 

14.8 

6 

139.8 

146.3 

16.6 

143.G 

13.3 

7 

151.6 

163.7 

16.8 

160.4 

14.3 

8 

137.7 

150.1 

15.5 

146.4 

12.9 

9 

114.8 

124.1 

12.9 

119.7 

8.8 

10 

89.2 

94.8 

11.0 

91.3 

7.3 

11 

65.8 

68.7 

7.9 

66.8 

5.4 

12 

46.8 

47.6 

6.0 

46.7 

4.6 

13 

32.1 

31.8 

4.2 

31.4 

3.7 

14 

21.1 

21.0 

2.0 

20.8 

1.8 

15 

13.6 

13.6 

0.8 

13.5 

0.7 

16 

8.7 

8.6 

0.2 

8.6 

0.2 

Total  otonr 

(atm-cm)  .329 

Oenh  mu  error, 
layers  1-11 
0o*e) 

.329 

.298 

.327 

.255 

1-12 


TABLE  7.-  VERTICAL  020NE  DISTRIBUTION  AT  ASPENDALE  IN  TERMS  OF  LAYER-MEAN 
OZONE  PARTIAL  PRESSURES.  SAMPLE  SIZE  = 33  [ DeLuisi  and  Mateer.  1971) 


Layer 

Measured 
Average 
0 imb) 

Total  ozone 
regression 
Average,  RMS, 

(umb)  (umb) 

Solution 

Average,  RMS, 

(umb)  ( umb ) 

1 

22.9 

22.2 

3.9 

23.2 

3.8 

2 

31.4 

23.2 

15.7 

25.2 

11.4 

3 

37.0 

32.8 

16.0 

33.5 

13.7 

4 

48.2 

49.7 

13.7 

50.5 

12.2 

5 

92.5 

92.3 

14.0 

93.5 

15.7 

6 

138.2 

137.5 

13.3 

138.1 

11.9 

7 

1S7.0 

157.4 

11.8 

157.5 

9.6 

8 

149.5 

148.3 

12.2 

147.1 

11.6 

9 

126.1 

124.2 

14.3 

121.1 

12.7 

10 

92.4 

95.4 

12.0 

91.8 

7.9 

11 

63.3 

69.0 

10.4 

66.1 

T5 

12 

43.3 

47.7 

7.5 

45.8 

6.0 

13 

29.4 

31.9 

4.1 

30.8 

3.3 

14 

19  8 

21.0 

2.0 

20.5 

1.6 

IS 

13.1 

13.6 

0.8 

13.4 

0.6 

16 

8.5 

8.6 

0.7 

8.6 

0.2 

Total  ozone 


(atm-cm)  .316  .316  - .315 

Overall  rms  error, 
layers  1-1 1 

(log*)  - - .256  - .205 

TABLE  8.-  VERTICAL  OZONE  DISTRIBUTION  AT  TALLAHASSEE  IN  TERMS  OF  LAYER-MEAN 
OZONE  PARTIAL  PRESSURES.  SAMPLE  SIZE  = 39  [ DeLuisi  and  Mateer.  19711 

Layer 

Measured 

average, 

(umb) 

Total  ozone 
regression 
Average,  RMS, 

(u  mb)  (umb) 

Solution 

Average,  RMS, 

(umb)  (umb) 

1 

19.8 

21.4 

8.2 

21.1 

6.7 

2 

19.7 

19.0 

10.4 

17.4 

10.2 

3 

24.7 

27.4 

11.8 

25.4 

9.7 

4 

40.4 

44.4 

14.0 

42.9 

11.7 

5 

80.5 

87.3 

15.5 

87.7 

13.7 

6 

135.5 

132.6 

15.6 

134.3 

12.3 

7 

164.8 

153.7 

20.7 

155.6 

18.4 

8 

157.0 

147.2 

14.2 

148.6 

13.0 

9 

125.7 

124.3 

12.0 

124.9 

9.2 

10 

92.7 

95.7 

14.6 

95.5 

11.9 

11 

67.3 

69.2 

9.5 

68.6 

7.5 

12 

46.9 

47.8 

5.4 

47.4 

4.5 

13 

31.4 

31.9 

2.8 

31.7 

2.4 

14 

20.8 

21.0 

1.3 

20.9 

1.1 

15 

13.6 

13.6 

0.5 

13.6 

0.5 

16 

8.6 

8.6 

0.1 

8.6 

0.1 

Total  ozone 

(atm-cm) 

.308 

.308 

- 

.307 

— 

Overall  rms  error. 

layers  1-11 

floge) 

- 

- 

.305 

- 

.269 

M3 


structure  and  not  accessible  to  reduction  by  inversion  of  Umkehr  measurements  has  not  been  investigated.  The 
results  of  DeLuisi  and  Mateer  [1971  ] suggest  that  it  would  be  more  advantageous  to  invest  effort  in  improving  the 
regression  estimation  method  rather  than  in  further  work  on  the  low-level  Umkehr  invention.  Similar  conclusions 
have  been  reached  by  Sellers  and  Yarger  [1969]  and  Yarger  [1970]  for  the  case  of  satellite  observations.  Insofar  as 
the  high-level  ozone  profile  is  concerned,  no  direct  comparisons  have  been  made  between  Umkehr  results  and  con- 
current rocket  data.  However,  although  the  high-level  ozone  profiles  derived  from  Umkehr  observations  are  particu- 
larly sensitive  to  measurement  bias  [Mateer,  1965]  and  apparently  haze  effects  can  lead  to  erroneous  conclusions 
about  seasonal  variation  at  high  levels  [DeLuisi  and  Furukawa,  1970] , the  high-level  Umkehr  results  are  gencially 
consistent  with  good  rocket  results.  The  high-level  profile  remains  the  one  area  of  potential  utility  of  carefully 
made  Umkehr  observations. 


DIFFUSE  REFLECTION 


Satellite  Measurements 


The  inference  of  the  ozone  profile  from  satellite  measurements  of  the  solar  ultraviolet  radiation  backscattered 
by  the  earth  and  its  atmosphere  is  conveniently  divided  into  two  subproblems:  the  inference  of  the  high-level  pro- 
file above  25  - 30  km  and  the  inference  of  the  low-level  profile  below  this  level.  The  primary  reason  for  this  distinc- 
tion is  economic  and  lies  in  the  complexity  of  the  light-scattering  calculations;  for  the  shorter  wavelengths  that 
“scan”  the  atmosphere  above  the  ozone  maximum  and  do  not  penetrate  through  the  ozone  layer,  a single-scattering 
physical  model  is  quite  adequate  to  calculate  backscattered  intensity  to  well  within  In  figure  2,  such  wavelengt!  s 
are  2975A  and  shorter.  For  wavelengths  that  penetrate  the  ozone  layer  and  are  backscattered  appreciably  within 
the  troposphere,  multiple  scattering  calculations  are  essential  and  the  effects  of  aerosol  scattering  as  well  as  cloud 
and  ground  reflections  become  quite  important.  In  addition,  a considerable  body  of  a priori  statistical  information 
about  the  low-level  ozone  profile  is  available,  whereas  relatively  few  reliable  data  are  available  for  the  high-level 
profile.  Because  of  these  difficulties,  the  high-level  inversion  has  received  considerable  attention  over  the  past 
decade,  while  few  pioneering  spirits  have  ventured  into  the  low-level  inversion  problem. 

The  possibility  of  deducing  the  ozone  profile  from  backscattering  measurements  was  first  suggested  by  Singer 
and  Wentworth  [1957] . The  first  mathematical  examination  of  the  problem  was  by  Twomey  [1961] , who  showed 
that  for  a single-scattering  atmospheric  model,  the  spectral  energy  distribution  of  the  backscattered  radiance  was  a 
Laplace  transform  of  the  ozone  profile,  when  atmospheric  pressure  *^os  expressed  as  an  explicit  function  of  the  mass 
of  ozone  above  a given  pressure  level.  This  method  has  been  used  in  the  USSR  to  evaluate  measurements  of  back- 
scattering  from  the  Soviet  satellites  [Kmnopol’skiy,  1966;  Ioienas,  1 968 ; /arenas  etal.  1969a,  b]. 

" ie  back  ottered  radiance  / in  the  satellite  nadir  direction,  for  solar  zenith  angle  $ and  wavelength  X in  a 
plane  parallel  at.-  aspheric  model,  is  given  by 

'(M)  = Fo(*)(30X/16ir) (1+008*0)  f exp  j -(1  + sec  0X<*X*p  + 0Xp)  ] d?  (21) 


o 


where 

F0(X)  = 

extraterrestrial  solar  irradiance 

h = 

atmospheric  scattering  coefficient  (atm  *' ) 

n\ 

ozone  absorption  coefficient  (atm-cm)_1 

II 

amount  of  ozone  above  pressure  p (atm)  in  atm-cm 

1-14 


If  we  define 


St&L. 


2ir. 

— i— . ; » 


G(X,<>) 


7(X,e)16n 

F0(\)  ■ 3/3x  • (1  + cos1  6) 


(22) 


assume  0^p  « a ^Xp  , and  set  it  = a^(  1 + sec  0 ) , then 

i 

G(* ) =y* e~kxP  dp  (23) 

0 

This  equation  has  the  form  of  (3),  wherein  the  profile  appears  only  in  the  kernel  and  not  separately  in  the  integrand. 
Twomey  [1961]  changes(23)  to  the  form(l)  by  letting  X become  the  variable  of  integration,  thus 


Q(k ) = 


-kXdP 


dX  + p(o) 

dX 


(24) 


where  X0  is  the  total  atmospheric  ozone  content  and  dpi  dX  is  the  inverse  of  the  ozone  mixing  ratio.  For  the 
large  values  of  k associated  with  X<2975  A,  we  may  let  Xq"00  at  the  upper  limit  of  integration  (fig.  2).  The 
second  term  on  the  right-hand  side  of  (24)  represents  scattering  by  air  molecules  entirely  above  the  ozone  layer.  It 

then  follows  directly  that 


a in  1/Fo 
a In  > 


0 - 01  -02  - 03  - 04  - 05 


CONTRIBUTION 


Figure  5 - Contribution  to  the  nadir-direction  radiance  by  back- 
scattering  at  various  levels  in  the  atmosphere  for  satellite 
observations  with  solar  zenith  angle  70°  (solid  curves).  Dashed 
curves  are  partial  derivatives  (see  also  text). 


G(*)  = p(o)  + *I*(p)  (25) 


where  L ^(p)  is  the  Laplace  transform  of  p -that  is. 


k 

e~kxp(x) d*  . 


The  inversion  problem 


reduces  to  one  of  finding  a suitable  analytical  form 
for  Q(k^  and  looking  up  its  Laplace  transform-  It 
is  necessary,  of  course,  to  ensure  that  the  analytical 
form  for  Q(k)  effectively  filters  out  the  random 
noise  in  Q(k)  [Twomey  and  Howell,  1963]. 


Existing  direct  observational  data  suggest  that 
the  high-level  ozone  profile  is  approximately  expo- 
nential with  height  [Green,  1964 ; Rawcliffe  and 
Elliott,  1966] , which  in  an  exponential  atmosphere 
is  directly  equivalent  to 


Xp  = Cpva  (26) 

where  C and  o are  constants  that  specify  the  profile, 
a being  the  ratio  of  the  ozone  scale  height  to  the 
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atmospheric  <cale  height.  If  (26)  is  substituted  in  (23)  and  the  integration  performed  (or  solve  for  p,  substitute  in 
(25),  and  look  up  the  transform),  we  obtain 


2(*)  - (*C)-«r(l  + o)  (27) 

where  F(x)  is  the  gamma  function  with  argument  x.  According  to  (27),  plotting  log  Q against  log  k will  give  a 
straight  line  with  slope  ~o.  Thus  a good  first  approximation  to  the  high-Lvel  ozone  profile  may  be  obtained  by 
calculating  .he  linear  least-squares  fit  between  log  Q and  log  k. 

In  attempting  to  infer  additional  information  about  the  ozone  profile  from  backscattered  radiance  measure- 
ments, it  is  convenient  to  substitute  (22)  in  (2 1 ),  to  obtain 


C(X.fl)  - 


Because  Q(\,d)  changes  by  about  one  order  of  magnitude  between  the  shortest  and  longest  wavelength,  it  is 
further  convenient  to  scale  the  system  by  writing 


fm  j-(l  + sec  6).(a^Ap  + Pfj>)  j ip 


(28) 


N(\ , fl)=  In  [Q(X,0)] 


(29) 


aid  to  write  the  inversion  problem  as 


2 


dU^—  e)-  Ab<*/»Ar(X,fl)-l/(M)  = *(X,0) 

3 In  AT/ 


(30) 


which  is  similar  to  (19)  in  form.  Because  (30)  is  a linearization  of  ar  essentially  nonlinear  problem,  it  is  appropriate 
to  perform  the  inversion  iteratively,  as  follows 


i 

where  v refers  to  the  wth  iteration.  This  equation  is  of  the  same  form  as  (5);  by  Twomey’s  [1963]  method,  it 
would  have  the  solution  given  by  the  computationally  more  economical  operation  of  (10)  or  (12). 


The  physical  situation  in  (31)  is  illustrated  in  figure  5,  where  the  solid  curves  show  the  contribution  to  the 
radiance  in  the  satellite’s  nadir  direction  due  to  backscattering  at  various  levels  in  the  atmosphere  for  2555  and 
3019A.  The  dashed  curves  show  the  partial  derivatives  for  2555, 2876,  and  3019A,  while  the  short  horn  mtal  bars 
show  the  position  of  the  parial  derivative  maximum  at  intermediate  wavelengths.  For  this  case,  the  sohi  -nith  angle 
is  approximately  75®. 

In  this  section,  we  examine  the  properties  of  solutions  obtained  by  the  iterative  application  of  Twomey’s 
method  [Sm/rh,  1968;  Herman  and  Yarger,  1969]  and  by  the  iterative  method  described  by  Smith  [1970] . Two 
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essentially  equivalent  methods  arise  from  Smith’s  Procedure,  in  the  first  of  these,  a separate  solution  is  ob- 
tained for  each  wavelength: 


(4  In**)'*1  = 


(32) 


The  solutions  for  each  wavelength  are  combined  to  obtain  a weighted  average  solution  for  each  layer,  using  the 
partial  derivative  for  each  wavelength  and  layer  as  the  weight,  thus 


(Aln**)»'+1 


(33) 


The  abere  may  be  described  as  the  “sum  of  ratios.”  In  practice,  the  ratio  of  the  sums  gives  almost  precisely  the  same 
result  The  latter  form  is  obtained  from  (30)  or  (31)  bv  taking  Ain*;  outside  the  summation  and  letting  i*k, 
where  k refers  to  layers  for  which  the  partial  derivative  is  large.  Each  side  of  the  equation  is  then  postmultiplied 
by  bU(\,  d)IB\nXk , summed  over  X , and  the  result  is  solved  for  Ain**  , obtaining 


(Aln**)v+' 


X 


£(M)1P 
din**  J 


(34) 


These  iterative  methods  are  essentially  relaxation  methods  [Shaw,  19S3]  wherin  it  is  customary  to  use  an  “over- 
relaxation” factor  (F)  to  spe  ;d  the  convergence,  which  is  usually  extremely  slow  for  large-scale  forcing  functions 
(the  partial  derivatives  plotted  in  fig.  S).  In  the  present  context,  the  overrelaxaticu  factor  is  used  in  the  following 
manner 


xj*1  * xf  exp  [F(AlnJti)v+' ] (Alnx,)**1  <0 

*r  * */  |l  + F(A!iue/)F+*  ] (A!ux/)vH  >0 


(35) 


V+l 


The  first  form  prevents  xj ' * from  becoming  negative  when  (Alnx;) v+l  is  moderately  large  and 
the  second  form  prevents  xj*‘  from  becoming  too  large  when  (Alnx|)v+I  is  moderately  large 
Both  forms,  of  course,  give  tne  tame  result  for  small  (Alnx/)1’*1.  To  determine  the  best  value  of  F, 
ary  to  plot  the  rms  residual  o^  ■ jl/n  ^[Fv+I  (X,  fl)] 1 j 1/4  , against  F for  various  numbers 


negative,  while 
and  positive, 
it  is  custom- 
of  iterations 
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and  to  choose  the  value  of  F providing  the  most  rapid 
decrease  in  . Such  plots  are  illustrated  in  figure  6 
for  values  of  F between  I and  2.  Although  the  posi- 
tion of  the  minimum  changes  slightly  with  the  number 
of  iterations,  the  minimum  is  fairly  broad  and  one 
would  probably  select  F r-  1.85  as  the  best  value. 

So'utions  obtained  after  100  iterations  with 
F=  1.0,  for  three  different  first  guesses,  are  shown  in 
figure  7.  The  radiance  data  used  here  were  obtained 
from  the  backscattered  u’traviolet  (BUV)  experiment 
on  Nimbus  4.  The  iwo  straight-line  first  guesses  assume 
constant  ozone  mixing  ratios  throughout  the  atmo- 
sphere and  are  separated  by  a factor  of  10.  The  smooth 
solid  first-guess  curve  is  obt  lined  from  (27),  using  least 
squares  to  derive  a and  C in  (26),  which  is  then 
modified  to  form  (cf.  Green,  1964) 


Cpva 

Xp=  l + l P/Pj7^ 


(36) 


Figure  6.-  RMS  residual  (o^)  as  a function  of  overrelaxation 
factor  (F)  for  various  numbers  of  iterations  (S,  10, 15,  20. 
which  is  the  same  as  (26)  for  small  p , but  for  larger  25,  u,in*  iterative  method  of  Smith  (19701  • 

p exhibits  a maximum  ozone  partial  pressure  at 

psp*.  For  the  case  illustrate  1 C*  90.16,  o - 0.614,  and  p«  * 0.0398  atm,  for  p in  atm  and  Xp  m atm -cm. 

In  application,  p*  is  specified  from  prior  knowledge  of  the  low-level  ozone  distribution.  The  main  purpose  of 
figure  7 is  to  demonstrate  that  the  solution  is  essentially  independent  of  the  first  guess.  Solutions  obtained  from 
these  same  first  guesses  after  five  iterations  by  Twomey’s  method  with  y = 0.005  are  illustrated  in  figure  8. 
Although  the  differences  between  these  individual  solutions  are  somewhat  greater  than  those  shown  in  figure  7, 
they  are  fairly  small  between  10~*  atm  and  10-2  atm  (1  and  lOpmb)  and  the  profiles  are  in  good  agreement  with 
those  in  figure  7,  between  these  same  pressure  levels. 


Some  other  properties  of  these  iterative  solutions  are  illustrated  in  figures  9, 10,  and  1 1 . In  figure  9,  we  show 
the  Twomey  solutions,  after  one,  five,  and  thirty  iterations,  using  y = 0.005  and  the  first  guess  defined  from  (36). 
The  tendency  to  emphasize  the  finer  structure  as  the 
number  of  iterations  increases  it  quite  clear,  and  y 
probab'  / should  be  increased  quite  appreciably  after 
the  second  iteration.  What  happens  in  layer  44 
(pressure  1.4  pmb),  which  is  the  peak  of  one  of  the 
finer  structure  features,  is  shown  in  figure  10  for 
Smith’s  iterative  procedure  with  F ~ 1 .0  and  1 .85  and 
for  Twomey’s  method.  These  curves  should  be  exam- 
ined simultaneously  with  the  corresponding  curves  of 
figure  1 1 , which  show  the  decrease  in  rms  residual 
(o^)  with  increasing  iteration.  Two  important  con- 
clusions may  be  drawn  from  these  results.  First,  there 
is  no  tendency  for  the  solution  to  converge  in  the 
normal  mathematical  sense.  In  this  layer,  the  solution  ozone  partial  pressure  <»»>») 

continues  to  change  bv  a finite  amount  with  each  sub-  _ , , _.  . „ 

• „ , Figure  7.-  lnvenion  mom  proflUs  (thick  Bnes)  obtained  after  100 

sequent  iteration,  while  the  corresponding  decrease  m iteration,  with  F \ Corresponding  first  guesee*  are  »hown 

rms  residual  becomes  quite  small.  The  point  at  which  by  thin  line*.  Smith  s iterative  method  used. 
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we  cease  extracting  real  profile  information  and  begin 
introducing  the  noise  into  the  solution  profile  is  by  no 
means  clear.  The  second  conclusion  to  be  drawn  here  is 
that  the  most  economical  way  to  reach  this  point  is  by 
Twomey’s  method. 

The  question  of  when  to  terminate  the  iterative  pro- 
cedure must  be  geared  primarily  to  the  known  or  assumed 
standard  error  of  the  radiance  observations.  There  appears 
little  reason  to  proceed  beyond  iteration  v if 

( oR)v  aN  (37) 


or  if  l^(X,0)|<2oA,  for  all  X , whichever  comes 
first.  Here,  is  the  standard  error  of  N , the  quantity 
computed  from  the  observations  (see  (22)  and  (29)).  Since 
an  individual  value  of  N may  be  in  error  by  more  than  2a ^ 
criterion  such  as 


Figure  9.- Inversion  ozone  profiles  after  1 ;0 

iterations  with  Twomey's  method  using  .'05. 

First  guess  is  thin  continuous  line  shown  $ . • 8. 

in  a particular  case,  it  is  also  useful  to  have  an  additional 


I/?**'  (X,0)-Kv(X,0)l  < 6 


(38' 


for  all  X , where  6 is  chosen  from  experience  and 
should  be  sufficiently  small  that  criteria  (37)  cause  termi- 
nation of  the  iterative  procedure  most  of  the  time. 

The  accuracy  of  the  iterative  inversion  procedure  is 
illustrated  in  figure  1 2,  which  shows  a rocket  profile 
obtained  at  Pt.  Mugu,  California,  using  an  optical  rocket- 
sonde  [Krueger,  1969] , and  two  profiles  inferred  from 
Nimbus  4 BUV  data  [Krueger  et  al.,  1971  ] . In  this  case, 
the  inversion  has  been  performed  with  and  without  the 


ozone  p*nn*L  pressure  <k*.»i 


Figure  8.  - Inversion  ozone  profiles  (thick  lines)  obtained  after 
5 iterations  with  Twomey's  method  using  y *0.005.  Corres- 
ponding first  guesses  are  shown  by  thin  lines. 


Figure  10.  - Mean  partial  pressure  in  layer  44  after  various 
numbers  of  iterations. 
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2975  a radiance  observation.  The  sun  is  sufficiently 
high  in  the  sky  for  these  observations  that  at  2975A  , 
there  is  penetration  of  the  ozone  layer  and  some  multi- 
ple scattering  effect  (similar  to  that  exhibited  by 
3019A  in  fig.  2).  Consequently,  the  observed  radiance 
at  297 5 A is  too  high  to  be  consistent  with  the  single 
scattering  model  used  in  the  inversion  and  the  profile 
shows  too  little  ozone  near  and  just  above  the  main 
maximum.  However,  the  inversion  profile  obtained 
without  2975A  is  quite  good.  These  satel^te  profiles 
have  no  validity  below  the  ozone  maximum. 


The  Low-Level  Ozone  Profile 

The  problem  of  properly  including  multiple 
scattering  in  the  inversion  of  backscattered  radiance 
measurements  to  obtain  the  ozone  profile  below  30  km 
was  first  solved  by  Homan  and  Yarger  [19691  ■ They 
considered  a satellite  experiment  in  which  measure- 
ments of  polarization  components  were  taken  at  dif- 
ferent observing  angles  in  the  plane  containing  the 
satellite’s  nadir  dirr  .on  and  the  sun.  These  measure- 
ments were  taken  at  a single  wavelength.  Yarger 
[1970]  extended  this  study  by  considering  the 
measurements  taken  by  the  Nimbus  4 BUV  experi- 
ment in  which  radiance  observations  are  taken  at  several  wavelengths  in  the  satellite’s  nadir  direction.  The  paitial 
derivative  method  may  alto  be  used  here,  following  the  method  of  DeLuisi  and  Mateer  [1971  ] to  include  multiple 
scattering  in  the  calculation  of  the  partial  derivatives.  No  attempt  has  yet  been  made  to  obtain  the  low-level  ozone 
profile  by  the  inversion  of  real  measurements  and  the  present  discussion  is  restricted  to  results  with  synthetic 
measurements.  Because  ihe  Nimbus  4 BUV  radiance  measurements  comprise  the  only  available  real  data  suitable  to 
this  purpose,  the  present  discussion  is  restricted  to  Yarger’:  [1970]  results. 


In  the  case  of  multiple  scattering,  the  equivalent  of  (21)  for  the  radiance  of  backscattered  radiation  in  the 
nadir  direction  may  be  written  in  the  form 


/Mr), 


,r,.d.X] 


where 

t optical  depth  from  the  top  of  the  atmosphere 
( t * 0),  to  some  level  within  the  atmosphere, 
hence  defining  vertical  position  in  the  atmosphere. 

w(f)  albedo  of  single  scattering  at  level  t and  is  the 
ratio  of  scattering  optical  depth  to  total  (scatter- 
ing plus  absorption)  optical  depth  in  unit  volume 
and.  hence,  defines  the  ozone  profile 


y[cj(r),r,fl,X]e-rw(r)df  (39) 


Figure  12.  - Comparison  of  optical  rocketsondc  end  Nimbus  4 
(BUV)  ozone  profflet  over  Ft  Mugu,  Calif.imii. 

[Krueger  el  el,  1971). 
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Ti  total  optical  depth  of  the  entire  vertical  column  of  the  atmosphere 

J the  so-called  “source  function,”  which  is  computed  in  matrix  form  by  successive  iteration  of  the  auxiliary  equa- 

tion  of  radiative  transfer  [Dave,  1964]  and  then  converted  to  scalar  form  for  use  in  Eq.  (39);  note  that  J in- 
cludes the  extraterrestrial  irradiance,  wh-ch  is  shown  separately  in  Eq.  (2 1 ). 

This  equation  is  quite  general  for  the  nadir  viewing  direction  in  a plane-parallel  atmosphere  that  is  horizontally  homo- 
geneous and  in  which  only  absorption  and  molecular  scattering  occur. 

Equation  (39)  has  the  same  form  as  (2),  with  the  unknown  p.ofde  appearing  explicity  in  the  integrand.  However, 
co(f)  is  wavelength  dependent  and  is  not  suitable  as  an  unknown  for  direct  solution  when  we  use  multiple  wavelengths. 
We  have 


CO  A I •"  " “ “ “ 

1 axXi  + 0X  Ap,-  Atj 


(40) 


where  subscript  i 
the  ith  layer,  and 
J,(X)e-f/(x) 


refers  ic  the  /lh  layer,  x,  to  the  ozone  content  of  the  ith  layer,  A p,-  to  the  pressuie  change  across 
A tj  the  tota*  optical  depth  of  the  layer.  With  this  substitution  in  Eq.  (39)  and  letting  D,(X)  = 


m = 


D/(X)OxAp/ 


(41) 


In  this  form, (41)  is  equivalent  to  (4)  with  f = Ap,  AjX  = Z)y(X)0x,  and  g = I,  in  this  section  1 refers  to  the  vector  of 
radiance  observations,  not  the  identity  matiix.  Yarger  calls  this  the  pressure  increment  (PI)  method. 


In  a second  procedure,  which  Yarger  calls  the  ozone  increment  (OI)  method,  he  proceeds  on  the  nth  iteration, 
with  X-,  /V(X),  and  0*"  (X)  , to  obtain  Xj'*1  by  writing 


/(X) 


[A(X)  Api/Atj  ] v 0X  At 


(42) 


1%)  - 


[£>l-(X)Ap,-/Arf],'/3xAff 


(43) 


Subtracting,  he  then  obtains 


7(X)-/*'(X)  = 


[Dl.(X)AP//Ar,]vpxox(^+1-^) 


(44) 


which  is  equivalent  to  (*),  with  Ag  = 1 -Iv,  Af  = X|,+1-Xv,  and  A-x  = Ap,-/Ar,]  v 0X  . Forhis 

solutions,  Yarger  uses  Twomey’s  minimum  departure  from  an  initial  guess  in  the  form  shown  in  (11). 


Yarger’s  inversion  results  for  an  ozone  profile  observed  at  Boulder,  Colorado,  on  Feb.  20,  1964,  arc  illustrated 
in  figures  13, 14,  ard  IS.  Figure  13  shows  results  for  both  01  and  Pi  methods  when  his  synthetic  measurements  con- 
tain no  random  error  and  his  initial  guess  is  derived  from  a statistical . Jtod  using  total  ozone  (assumed  known)  as 
predictor.  Figure  14  shows  results  for  the  PI  method  when  his  synthetic  measurements  contain  5%  random  error  and 
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he  starts  from  a rather  poor  initial  guess.  Apart  from  a 
slightly  lower  partial  pressure  at  the  main  maximum 
in  figure  14,  the  solution  is  essentially  the  same  as  that 
shown  in  figure  13,  suggesting  th-t  the  result  is  inde- 
pendent of  the  first  guess.  Figure  IS  shows  Yarger’s 
results  for  the  PI  method  when  the  statistical  first  guess 
is  used  and  for  the  two  cases:  no  measurement  errors 
and  5%  measurement  errors.  Because  he  was  unable  to 
achieve  any  real  improvement  over  his  statistical  first 
guess,  Yarger  concluded,  “. . . there  seems  no  reason  to 
prefer  an  inversion  method  based  on  the  inversion  of 
the  radiative  transfer  equation  over  the  statistical 
method  for  inferring  information  concerning  the  ver- 
tical ozone  distribution.”  Since  statistical  inferences 
are  not  without  error  and  because  of  the  result  illu- 
strated in  figure  14  (poor  initial  guess),  it  is  difficult  to 
agree  completely  with  his  conclusion.  However,  Yarger's 
conclusion  may  be  correct  for  other  reasons-namely, 
the  difficulty  of  properly  accounting  for  surface  reflec- 
tivity, as  well  as  scattering  by  haze  and  clouds  in  the 
inversion  of  real  data  [Twomey,  1969] . Yarger 
recognized  these  additional  problems. 


Figure  IS.-  Inversion,  initial  guess  and  actual  ozone  profiles  for 
Boulder,  Colorado,  Feb.  20, 1964,  for  5%  measurement  errors 
[Yarger,  1971). 


CONCLUDING  REMARKS 


It  appears  that  the  application  of  mathematical  inversion  methods  to  infer  the  high-level  ozone  profile  from 
light  scattering  measurements  provides  useful  results.  However,  the  inference  of  the  low-level  ozone  profile  remains 
a venture  of  somewhat  dubious  value,  not  only  because  of  deficiencies  in  economical  computational  models,  but 
also  because  of  marginal  information  retrieval,  even  under  the  idealized  conditions  of  synthetic  experiments.  It  may 
prove  more  fruitful  to  measure  total  ozone  by  independent  techniques  and  to  use  statistical  methods,  with  total 
ozone  as  predictor,  to  infer  the  low-level  ozone  profile. 

From  the  mathematical  point  of  view,  in  the  absence  of  prior  statistical  information,  the  evidence  suggests  that 
the  method  of  Twomey  is  more  economical  than  the  iterative  method  for  inferring  high-level  ozone  profiles  from 
backscattered  radiance  measurements. 

It  is  quite  clear  that  the  mathematical  inversion  of  integral  equations  of  the  first  kind  should  not  be  considered 
as  a solution  of  a set  of  linear  alget  sic  equations,  but  rather  as  a filtering  proceuurc  in  which  the  real  information  in 
the  radiance  observations  is  transmitted  to  the  solution  profile  essentially  undamped,  but  the  spurious  fine  structure 
in  the  solution  profile,  which  arises  from  the  random  noise  in  the  observations  (and  in  the  mathematical  model),  is 
damped  out  entirely. 
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DISCUSSION 


Parker:  Whal  is  the  total  number  of  data  observations  entering  in  the  work  where  you  have  so  many  iterations  to 

solve  the  equations?  Clearly  not  the  six  you  mentioned  earlier. 

Mateer:  Five,  six,  or  seven,  depending  on  zenith  angle  of  sun. 

Parker  Why  do  you  need  an  iterative  technique  to  solve  a bunch  of  linear  equations  when  there  are  only  six  of 
them? 

Mateer:  Because  the  system  is  not  linear.  You  make  a first  guess  and  rerc”  >ute  residuals.  There  are  two  ways  to 

do  this  and  you  should  compare  them  to  see  if  the  system  is  linear. 

Bojarski:  What  is  an  uppe-  limit  of  N that  you  would  like  to  be  able  to  solve?  Obviously,  the  more  the  better. 

Mateer:  Not  necessar.ly.  As  shown  by  Twomey  the  measurements  are  linearly  dependent.  Suppose  that  you  have 

12  observations.  Given  say  the  best  4 of  those  12,  you  can  predict  the  other  8 within  the  error  measurement. 

I would  like  to  nave  about  6,  to  allow  some  redundancy. 

Bojarski:  How  about  the  number  of  data  points  in  the  final  result? 

Mateer:  In  the  range  70  to  1 SO  points.  The  matrix  to  be  inverted  is  still  the  size  of  the  number  of  measurements, 

6X6. 

Unidentified  speaker:  What  makes  the  solutions  nonunique?  You  said  there  were  an  infinite  number  of  solutions. 

Mateer:  The  variance  amplification  factor  is  responsible  in  the  complete  solution.  The  eigenvalues  are  so  small  for 

the  fine  scale  patterns.  You  must  be  careful  to  filter  just  the  right  amount. 

Chahine:  The  solution  of  an  integral  equation  with  fixed  limits  is  very  stable  and  is  unique  if  it  exists.  The  insta- 

bilities shown  here  are  due  to  the  fact  that  the  equations  were  linearized,  and  linearization  is  improper.  This 
violates  two  or  three  basic  mathematical  rules.  The  damping  functions  in  the  Twomey  method  and  other 
methods  are  means  of  rectifying  the  destruction  of  the  integral  equation  when  it  was  changed  from  a non- 
linear system  into  a linear  system  of  equations.  The  Method  of  Relaxation  is  one  means  of  solving  the  non- 
linear system  without  linearizing  it. 
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Statistical  retrieval  methods  for  remote  sounding  are  reviewed.  Methods  are  given  for  constraining  an 
essentially  incomplete  problem  by  means  of  the  known  statistical  behavior  of  the  solution.  Information  content 
of  the  observations  and  the  meteorological  structure  is  discussed.  Linear  versions  of  maximum  probability  and 
minimum  variance  methods  are  given  in  some  detail,  and  extensions  to  the  nonlinear  case  are  described. 


INTRODUCTION 


The  thermal  radiation  emitted  by  the  atmosphere  depends  on  the  distribution  of  temperature  and  absorbing 
substances,  including  water  vapor  and  clouds.  The  form  of  this  dependence  is  known,  so  that  in  principle  it  is  possible 
to  sound  the  meteorological  structure  of  the  atmosphere  from  remote  measurements  of  radiation,  either  from  the 
ground  or  from  a satellite. 

One  of  the  main  problems  in  interpreting  remote  soundings  of  ar.  atmosphere  is  that  there  are  not  enough  inde- 
pendent quantities  in  any  one  observation  to  determine  the  meteorological  structure  as  completely  as  one  would  wish. 
There  are  an  infinite  number  of  valid  solutions  that  agree  with  the  observation  within  experimental  error,  thus  giving 
us  the  problem  of  choosing  ihe  ‘best’  solution  in  some  sense. 

Statistical  methods  use  a priori  knowledge  of  the  statistical  behavior  of  the  atmosphere  as  a constraint  on  the 
solution.  Two  basic  approaches  have  been  used:  In  the  maximum  probability  method,  a probability  density  func- 
tion (pdf)  is  set  up  to  describe  the  atmosphere,  and  the  chosen  solution  is  the  one  most  likely  consistent  with  the 
observations;  the  minimum  variance  method  finds  a relation  that  obtains  the  meteorological  structure  from  the  ob- 
servati  ms  in  such  a way  Jut  the  deviation  of  the  derived  structure  from  the  original  structure  is  minimum  in  a 
statistical  sample. 

In  what  follows,  matrix  notation  is  normally  used,  although  suffices  are  used  where  necessary  to  clarify  the 
meaning  of  an  equation.  Occasionally  the  same  symbol  is  used  for  a vector  and  for  a matrix  comprising  a statistical 
sample  of  such  vectors.  The  distinction  should  be  clear  from  context. 


MODELS  FOR  THE  METEOROLOGICAL  STRUCTURE 


The  meteorological  structure  of  the  atmosphere  at  a particular  place  can  be  taken  to  consist  of  the  vertical 
temperature,  humidity,  cloud* 'and  possible  ozone  profiles.  These  are  all  continuous  functions  of  height  and  cannot 
be  represented  mathematically.  To  be  treated  at  all,  they  must  be  approximated  in  terms  of  a finite  number  of 
parameters.  The  simplest  treatment  is  a discretization  in  which  values  of  temperature,  humidity,  and  the  like  are 
specified  at  a number  of  fixed  levels  and  interpolated  between  these  levels.  This  is  s particular  case  of  a general  linear 
model  such  as 
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(1) 


0(z)  = S0(2 ) + 


bjSjiz) 


where  b t , i>2 , . . . , Pp  are  p parameters  representing  the  temperature  profile  6 (z)  as  a linear  combination  of  the 
functions  Sj(z) . These  functions  are  arbitrary,  except  for  the  restriction  that  they  be  independent. 

Most  representations  are  inefficient  in  the  sense  that  even  though  the  functions  Sj(z)  may  be  algebraically  in- 
dependent, the  coefficients  bj  are  not  statistically  independent.  This  is  especially  true  of  a discretization,  in  which 
the  temperature  at  any  one  level  is  strongly  correlated  with  the  temperature  at  nearby  levels.  It  is  clear  that  the  num- 
bers of  parameters  required  to  describe  a profile  to  a specified  accuracy  could  be  reduced  if  the  parameters  were 
statistically  independent.  It  is  also  clear  that  the  model  functions  Sj(z)  should  also  be  orthogonal,  and  for  conven- 
ience they  should  be  orthonormal.  Statistical  independence  implies  that  the  covariance  matrix  of  the  parameters  b 
should  be  diagonal: 

M 

j 2 N-V*  - o) 

m=l 


where  m indicates  membership  of  a sample  of  size  M,  and  A is  a diagonal  matrix. 

We  first  discretize  Eq.  ( 1)  so  that  we  can  use  the  algebra  of  matrices,  rather  than  the  algebra  of  functions: 

P 

&mz  = &Z*  bmj  SjZ 
/-I 

we  let  S0  be  the  mean  profile  6 . Orthonormality  of  S implies  that  S ^ = S_1  , so  we  can  put 

($-e)sT  = b 

and  on  substituting  this  in  Eq.  (2)  we  obtain 

^s(e-e)T(tf-d)sT  = a 

Aa 

so  that  the  functions  S are  eigenvectors  of  the  covariance  matrix  of  Q : 

h = ~(e-d)T(0-d) 

M 

It  can  be  shown  that  the  eigenvalue  A jj  is  the  amount  of  variance  “explained”  by  the  term  bmjSjz  >n  the  expan- 
sion of  $ , and  if  the  expansion  is  arranged  in  decreasing  order  if  A jj , it  can  be  truncated  at  any  term,  giving  the 
most  accurate  representation  for  that  number  of  terms  for  that  statistical  sample  in  a least-squares  sense. 

These  functions  are  known  as  empirical  orthogonal /unctions,  or  characteristic  patterns  [Lorentz,  1956; 
Obhukov,  1960;  Grimmer,  1963]. 
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TABLE  1-  EIGENVALUES  OF  SOME  MATRICES 


* 

H 

khkt 

K KT 

1 

2 

3 

4 

1 

0-5886. 

05886 

0.8941 

0.8941 

1.37 

2 

0.18S8 

0.7744 

0.0852 

0.9793 

0.44 

3 

0.1075 

0.8819 

0.0152 

0.9945 

0.32 

4 

0.0422 

0.9241 

0.0033 

0.9978 

0.185 

5 

0.0202 

0.9443 

0.0018 

0.9996 

0.090 

6 

0.0114 

0.9557 

0.0003 

0.9999 

0.038 

7 

0.0101 

0.9658 

0.0000 

1.0000 

0.014 

8 

0.0076 

0.9734 

0.0000 

1.0000 

0.0008 

T 

In  the  case  of  H and  K UK  the  eigenvalues  have  been  normalized  so  that 


£ fijj  = 1 . Thus  cols.  1 and  3 ate  the  fraction  of  variance  explained  by  the 
corresponding  eigenvector,  and  cols.  2 and  4 are  the  fraction  explained  if  the 
expansion  is  truncated. 


Table  1 illustrates  how  well  the  profile 
of  Planck  functions,  which  is  related  to 
temperature,  can  be  approximated  with  a 
small  number  of  parameters.  The  sample  is 
for  altitudes  of  0 to  70  km  for  the  northern 
hemisphere.  The  first  characteristic  pattern 
explains  nearly  60%  of  the  total  variance  in 
the  sample;  97%  of  the  variance  can  be 
explained  with  eight  patterns  and  therefore 
with  an  eight-parameter  model.  Thus,  fewer 
observations  than  expected  are  required  to 
determine  the  temperature  profile. 

The  analysis  in  this  section  has  been  in 
terms  of  temperature  only,  but  the  same 
method  can  be  applied  to  all  the  quantities 
making  up  the  meteorological  structure. 


THE  RADIATION  OBSERVATIONS 


The  radiance  / emitted  at  the  top  of  an  absorbing  atmosphere  can  be  written  as 


m = I (z))  (z ,°°)  dz  + B(v,e  (*))  — (*,oo)  (3) 

J dz  dz 

T? 

where  B(v,0)  is  the  Planck  function  at  wavelength  v and  temperature  0 , T(z ,°°)  is  the  transmission  of  the 
atmosphere  between  height  z and  a satellite,  and  g is  the  ground.  The  transmission  will  depend  on  the  distribu- 
tion of  absorbing  gases,  and  to  a small  extent  on  the  temperature  profile.  This  relation  is  essentially  an  integral  trans- 
form of  B,  the  Planck  function  profile,  with  kernel  dT/dz,  the  transmission  gradient. 

If  we  consider  a small  range  of  wavelengths,  and  a constant  distribution  of  absorber,  we  may  discretise  Eq.  ( 3) 
and  obtain  the  following  relation: 


//  = Kiz Bz  i = l, ...  n (4) 


where  //  is  the  radiation  at  frequency  vjt  Bz  is  B(v,0(z))  and  the  matrix  Kfo  is  related  to  the  transmission 
gradient.  This  simplified  form  is  useful  for  investigating  the  nature  of  the  problem,  although  more  detail  is  required 
in  practice.  Figure  1 shows  typical  set  of  K(z  . The  ordinate  is  z,  and  the  curves  are  labeled  with  i.  The  Planck 
function  is  a known  function  of  temperature  and  can  be  used  to  replace  temperature  in  the  analysis.  The  general 
linear  model  (1)  can  be  used  for  Bz  in  a discretized  form: 


Bz 


Stjbi 
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If  this  is  substituted  in  Eq  (4)  we  get 


P 

■'i  - Ti  * I A‘i»i 

M 

where  1 = KB  and  A = KS  , giving  a linear  relation  between  the  observations  and  the  model  parameters. 

The  components  of  the  observation  1 are  not  statistically  independent  partly  because  of  the  overlap  of  the 
kernels  (fig.  1),  and  partly  because  of  the  statistical  nature  of  the  atmospheric  Planck  function  profile.  Tc  get  an 
idea  of  how  much  information  there  is  in  a sample  of  observations,  we  can  transform  them  into  a statistically  inde- 
pendent form  using  the  same  method  as  used  for  the  vertical  profile  of  temperature.  The  covariance  ma::ix  of  the 
observations  is 


M 


^ 2 Vmi-WmrV  = -^K(BtB)(B-B)TKT  = KHK* 


m=l 


T 

where  H is  the  covariance  matrix  of  the  Planck  function  profile  B . The  set  of  eigenvalues  of  KHK1  given  in 
table  1 for  a typical  case  with  eight  kernels  shows  that  over  99%  of  the  variance  is  explained  by  the  first  three 
characteristic  patterns. 

The  interdependence  of  the  kernels  can  be  expressed  by  a similar  analysis.  If  we  seek  a transform  L so  that 


the  transformed  kernels  LK  are  orthogonal,  we  must  have 

LKKTL  = A 

when  A is  diagonal.  The  required  transform  is,  therefore, 
the  eigenvectors  of  KK^ , and  A g can  be  interpreted  as 
the  square  of  the  ratio  of  the  transformed  signal  to  the 
untransformed  signal  where  noise  is  constant.  The  number 
of  independent  observations  can  be  viewed  as  the  number 
of  eigenvalues  that  are  larger  than  (noise  variance) /(signal 
variance). 

THE  UNEAR  CASE 


The  linear  case  is  convenient  for  illustrating 
statistical  retrieval  techniques  without  the  algebraic 
complications  of  the  general  case.  The  problem  is  as 
follows. 

Given  a set  of  observations  I , which  are 
related  to  the  profile  parameters  b by 


I = Ab  + e (5) 


Figure  1.-  A typical  kernel  for  sounding  the  temperature 
profile,  using  the  CO,  ISn  band  [Rodgtrt,  1971 ) 
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where  A is  a given  matrix  and  e is  a random  variable  (experimental  error)  of  known  statistical  behavior,  and  given 
a sufficiently  large  statistical  sample  of  profiles  b , find  a statistically  optimum  inverse  relation  giving  b explicitly 
as  a function  of  I. 


Maximum  Probability 

The  maximum  probability  method  finds  the  profile  b that  maximizes  the  conditional  probability  density 
function  (pdf)  /’(bll)  -that  is,  the  most  likely  solution  consistent  with  the  observations.  The  expression 
/*(b  i ()db  denotes  the  probability  that  b lies  in  the  range  b,  b + db  when  I is  given.  Using  Bayes  theorem, 
we  write: 


P(llb)  = 


/»(!) 


(6) 


which  is  the  condition  pdf  of  the  observation  I given  the  atmospheric  state  b.  If  there  were  no  experimental  noise 
it  would  be  a delta  function  at  I * Ab , the  theoretical  value  of  the  observation.  The  presence  of  noise  turns  it  into 
a gaussian  distribution  centered  at  the  same  place: 


-2  logg  P(b  1 1)  = (I  - Ab)T  E'1  (1  - Ab)  + const 


where  E is  the  covariance  matrix  of  experimental  error,  which  is  usually  diagonal. 

We  estimate  P(b)  from  the  given  statistical  sample  of  b , and  if  we  choose  sufficiently  restricted  regions  of 
time  and  space,  a gaussian  distribution  can  be  used: 


-2  logg  P{ b)  = (b-b)TH~'(b-b)  + const 


where  b is  a mean  and  H is  the  covariance  of  b about  this  mean.  If  we  are  using  an  empirical  orthogonal  function 
representation  of  the  profile,  b will  be  zero  and  H will  be  diagonal. 

The  term  P(I)  can  be  ignored  in  Eq.  (S),  as  we  wish  to  maximize  F(I)  with  respect  to  b only.  This  is  done 
by  minimizing  -2  logg  P(b  II)  with  respect  to  b: 

~[(b-b)rH-‘(b-b)  + (I-Ab)TE"‘(I- Ab)|  = 0 


leading  to  the  solution 

b « (H~*  + aV  A)-1  ATE_,U  -T)  + b (7) 

The  most  likely  profile  is  thus  a linear  function  of  the  observations,  and  the  form  of  this  linear  function  can  be 
precomputed.  An  alternative  form  can  be  found  by  manipulation: 


Note  that  we  do  not  require  that  the  number  of  parameters  p in  the  model  for  the  meteorological  structure  b be 
less  than  or  equal  to  the  number  of  measurements  n comprising  an  observation  I . The  use  ot  statistics  has  intro- 
duced enough  extra  equations  into  the  system  to  prevent  the  problem  being  incomplete.  Of  the  two  forms  (7)  and 
(8)  of  the  solution,  (7)  will  be  used  if  p <n  , and  (8)  will  be  used  if  p > n , so  that  we  operate  on  the  smallest 
matrices.  If  Eq.  (8)  is  used,  there  is  little  need  to  reduce  the  number  of  model  parameters;  a simple  discretization 
can  be  used. 

Many  nonstatistical  inverse  methods  lead  to  relations  of  the  general  form  of  (7)  or  (3),  allowing  us  to  give  a 
statistical  interpretation  of  the  nonstatistical  constraints  used  [Rodgers,  1971]. 


Minimum  Variance 

In  the  minimum  variance  approach  we  choose  a linear  solution  b*  of  the  form 


b*  = D(I.-I)  + b 


in  such  a way  that  the  residual  variance 


m 

— S (b 

M ^ 


b*)J 


m=l 


is  minimized  for  the  statistical  sample.  This  is  straightforward  multiple  regression  of  b on  I . 


The  solution  is 


d = (b-bXi-T)T  [(l-Txi-i)1]"1 


(9) 


The  statistical  sample  can  be  obtained  by  collecting  coincidences  of  remote  soundings  and  direct  soundings  [Smith, 
1969] . This  procedure  has  tl  e advantage  that  the  kernels  need  not  be  known  accurately,  nor  is  absolute  calibration 
of  the  instrument  necessary.  Alternatively,  the  sample  can  consist  of  direct  soundings,  for  which  the  radiances  I 
are  computed  using  Eq.  (5).  If  we  substitute  (5)  in  (8)  we  get 

D = HAT(AHAT  + E)~' 


where  H and  E are  the  covariance  matrices  defined  in  the  preceding  section.  The  final  solution  b*  is,  therefore, 
identical  to  the  maximum  probability  solution 

b*  « HAT(AHAT  + Er'(I-I)  + b (10) 


This  equivalence  occurs  in  the  case  of  gaustian  statistics  and  a linear  solution. 

Note  that  although  D is  in  a sense  an  inverse  of  A , it  is  not  a generalized  inverse  in  the  sense  of  Moore 
[1935]  and  Penrose  [1955],  because  neither  DA  or  AD  is  a unit  matrix.  If  E = 0,  then  D is  a generalized 
inverse  of  A.  If  E*  0,  the  solution  found  from  ( 10)  does  not  give  the  same  radiances  as  measured,  but  givet 
radiances  within  experimental  error  of  those  measured. 
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The  maximum  probability  method  can  be  extended  to  include  any  kind  of  constraint  that  can  be  expressed  as 
a pdf.  As  described,  the  statistics  could  be  climatological,  covering  some  period  of  time  and  space.  However,  we 
could  make  use  of  a forecast  profile,  in  which  case  we  use  the  statistics  of  forecast  errors: 


-2  logg  P(b)  * (b  - bf)T  Hf‘  (b  - bf) 


where  bf  is  the  forecast  profile,  and  Hf  is  the  covariance  matrix  of  forecast  errors. 

Another  possibility  is  to  use  continuity  along  the  suborbital  track  in  the  case  of  satellite  measurements. 
Theoretically,  we  could  treat  all  the  observations  made  during  one  orbit,  say,  as  one  observation  and  retrieve  the 
meteorological  structure  as  a function  of  horizontal  distance  a well  as  height.  Unfortunately,  this  is  out  of  the 
question  because  of  size  of  the  matrices  involved.  However,  we  can  use  the  statistics  of  horizontal  temperature 
differences,  and  include  the  following  pdf  as  a constraint: 


-2  log,,  P(b)  * (b-bp)THA-'(b-bp) 


where  bp  is  the  previous  retrieval  and  H A is  the  covariance  matrix  of  differences  between  profiles  at  the  appro- 
priate distance  apart. 


THE  NONLINEAR  CASE 


The  notation  we  have  used  so  far  an  be  generalized  for  the  non’ inear  problem.  The  vector  b becomes  a set 
of  parameters  describing  the  meteorological  structure,  not  necessarily  linearly.  The  observation  I is  now  a known 
function  A(b)  of  the  structure  with  the  addition  of  experimental  ncise  e . 

The  most  straightforward  method  of  solution  is  to  linearize  *he  problem,  and  solve  it  with  one  of  the  linear 
methods  described  above,  iterating  until  convergence  is  obtained.  If  the  problem  initially  is  nearly  linear,  a careful 
linearization  may  eliminate  the  need  foriteration. 

A generalization  of  the  minimum  variance  can  be  stated  as  follows:  Given 

1 * A(b)  + e 


find  the  function  D(I)  ■ b*  such  that 


£ (b  - b*)1 


is  minimum  for  a given  sample  of  profiles.  This  variance  could  be  generalized  for  example  to 

(b-b*)TQ(b-b*) 

for  some  suitable  matrix  Q.  Stated  thus,  the  problem  is  difficult.  The  maximum  probability  method  is  more 
tractable. 


1-32 


We  can  write 


-2  log®  /»(Ji  >b)  » (!-A(b)^'£'  '(I-A(b))  + const 


and 

-2  logg  /»( b)  = (b  - b)r  H"1  (b  -7) 


The  solution  that  maximizes  P(b  11)  can  now  be  found  by  minimizing  the  explicit  function  of  b 

(I  -A (b))Tr’ (i -A  (b))  + (b  - b)TH"  (b  - b)  ( 1 1) 


using  a general  numerical  minimizing  algorithm  [e.g.,  Powell,  1964] . For  particular  forms  of  A(b)  it  may  be  possible 
to  construct  more  efficient  minimizing  routines,  or  even  to  solve  Eq.  (1 1)  algebraically.  If  the  solution  is  performed 
numerically  it  is  advantageou . to  reduce  as  far  as  possible  the  number  of  parameters  used  to  describe  the  structure. 
This  can  conveniently  be  done  by  using  empirical  orthogonal  functions. 

it  is  common  in  meteorological  remote  sounding  for  the  observations  to  be  linear  in  some  quantities  such  as 
Planck  function,  but  nonlinear  in  other  quantities  such  as  absorber  distributions.  It  is  possible  to  separate  the  linear 
and  nonlinear  parts  of  the  problem  in  terms  of  the  maximum  probability  method  [Rodgers,  1970] . We  can  put 


I = A (b)  + e = K(n)  • I + e 


where  8 is  the  linear  pc  rt  of  b and  n is  the  nonlinear  part.  The  kernel  K(n)  is  a function  of  absorber  distribu- 
tion. Using  Bayes  theorem  we  can  put 


/»(b 1 1)  - PO.nll)  « /*(!  Is, !)/*(! In) /*(n) //*(!) 


(12) 


where  the  pdf  are  of  the  forms 


-2  lofe/XI  In.I)  « (I  - *(n)  * >)T  * E_1  (n)  • (I  - K(n)  • I) 

-21oge/>(lln)  - (1-I(b))T*  IT1  (a) ’(!-!(■))  (13) 


where  the  covariance  of  1,  H,  and  mean  value  of  the  linear  part  T may  depend  on  n.  If  we  keep  n fixed  and 
solve  for  the  most  likely  I,  we  obtain  the  same  solution  as  before  (7): 

1*  -T(n)  - HKT(KHKT  + E)-‘  (I  -7(n))  ( 14) 


On  substituting  this  and  Eq.  (13)  back  in  (12)  we  obtain 
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-2 loje P(n,lll)  - [l-T(n)]T  • X(n)  • [i-T(n)]  -2ioge  P{ n)  + const 


(15) 


where 


X(u)  * (K(n)H(ii)KT(n)  + E)_l 


This  quantity  can  now  be  minimized  numerically  with  respect  to  the  nonlinear  parameters,  and  the  resulting  value  of 
n*  substituted  in  (14)  to  obtain  the  linear  parameters  1*.  The  matrix  X is  the  inverse  of  the  covariance  matrix  of 
I forgiven  n.  Thus  the  quantity  being  maximized  to  find  n*  is  essentially  /*(nll). 


COMMENTS 


It  is  not  possible  in  a review  such  as  this  to  illustrate  the  practical  use  of  all  the  retrieval  methods  discussed. 
The  accuracy  of  retrieval  depends  as  much  on  the  particular  radiances  measured  as  on  the  retrieval  method.  It 
depends  on  the  number,  shape,  and  location  of  spectral  intervals;  the  spatial  resolution  of  the  instrument;  experi- 
mental accuracy  ; the  precise  nature  of  the  atmospheric  model;  and  many  other  things.  Discussions  of  this  aspect  of 
the  subject  can  be  found  in  Matter  [1965] , Kodgen  [1970] , Strand  and  Westwater  [1968o,6],  Work  and  Flemi>.g 
[ 1966] , Conrath  [ 1968, 1969] , and  Turchin  el  d.  [1969] . 


DISCUSSION 


Chahine:  With  the  statistical  method,  clearly  one  can  get  most  proDaL,..  t . . nu  What  about  the  least  probam. 

profile? 

Rodgers:  That  is  a slightly  philosphical  matter.  Given  an  observation  you  want  to  find  some  sou.:  - »'ou 

have  an  infinite  number.  Which  one  do  you  want?  The  one  I want  is  the  one  most  likely  to  be  right. 

Chahine:  As  a physicist  I am  interested  in  the  unusual.  1 would  like  to  find  that  when  it  exists. 

Rodgeis:  If  a much  more  likely  profile  is  just  as  consistent  with  the  observations  this  is  surely  the  answer  you 

want. 

Falcone:  If  you  take  the  Bayesian  technique,  it  reduces  directly  to  Westwater-Strand’s  method,  because  you've 

used  gaussian  distributions.  If  you  don’t  and  you  don’t  maximize  with  respect  to  B,  you  can  develop  this 
system  into  a Kallman  filter  system,  which  yields  Smith’s  result,  except  that  your  iteration  scheme  gives  you 
a weighting  function  which  you  get  from  the  theory.  You  can  get  a mathematically  good  theory  as  Chahine 
wants,  by  looking  at  Tiekenoff  s method.  He  uses  compact  functions,  and  this  is  a good  theory.  Mathe- 
matically, it  has  its  basis  and  it  reduces  to  the  Westwater-Strand  method.  This  allows  you  to  play  the  non- 
linear game,  the  same  as  Staelin’s  m-thod. 

Rodgers:  I think  this  is  the  most  important  point.  You  want  to  choose  the  method  that  you  can  generalize  to  the 

nonlinear  case. 

Fleming:  The  standard  technique  for  solving  nonlinear  problems  is  to  take  the  nonlinear  equations  and  approxi- 

mate them  by  a sequence  of  linear  equations,  which  is  what  has  been  done  in  these  two  papers.  Why  does 
Chahine  insist  we  solve  directly  a nonlinear  problem? 

Chahine:  In  the  15m  case  that  Dr.  Rodgers  is  treating,  the  condition  factor  is  of  the  order  of  10s , which  means 

that  small  noise  or  error  in  measurements  will  be  amplified  by  this  factor,  making  the  solution  obtained  by 
this  linearization  process  inaccurate. 

Smith:  That  is  the  whole  point  of  the  error  covariance  matrix,  which  is  to  dampen  the  error. 

Chahine:  When  you  are  using  a damping  factor  you  should  look  at  how  much  information  you  are  getting  from 

your  measurement  and  how  much  information  you  are  getting  from  your  damping  function.  All  the  signifi- 
cant information  you  are  getting  is  coming  fiom  the  latter. 

Waters:  To  what  extent  does  the  accuracy  of  your  solution  depend  on  the  statistics  you  use-for  example,  locally 

and  one  season  versus  globally  and  all  seasons- and  is  there  an  optimum  way  of  breaking  the  tatter  down? 

Rodgers:  We  haven’t  been  able  to  do  that  as  yet.  There  is  not  an  optimum  way  of  breaking  this  down,  but  a lot  of 

research  is  going  into  it. 
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Mathematical  techniques  used  in  atmospheric  profile  inversion  are  revic  ..  limited  to 

methods  that  do  not  require  extensive  information  on  the  statistical  properties  of  the  profiles.  The  . 
the  calculation  of  vertical  profiles  from  noise  contaminated  infrared  spectral  measurements  is  examined.  Three 
specific  examples  of  methods  of  solution  art  given:  the  iterative  minimum  estimation  method,  a second  linear 
iterative  method,  and  the  algorithm  of  Chahine.  Satellite  data  are  used  in  an  empirical  comparison  of  the  methods. 
Several  factors  are  considered,  including  the  stability  of  solutions,  convergence  behavior  of  the  iterative  methods, 
and  dependence  of  the  solutions  on  the  first  guess,  a factor  that  is  particularly  important  in  the  treatment  of  data 
from  other  planets. 


INTRODUCTION 


The  first  explicit  suggestion  that  temperature  profiles  in  the  ter re« '.rial  atmosphere  could  be  obtained  from 
measurements  performed  from  an  earth  satellite  was  made  by  Kin?  [1956] . It  was  proposed  that  limb-scan  measure- 
ments of  thermally  emitted  atmospheric  radiation  be  used  for  this  purpose.  Kaplan  [1959]  pointed  out  that 
measurements  obtained  in  selected  intervals  within  an  infrared  absorption  band  with  a nadir-viewing  instrument 
could  be  used  for  the  same  purpose.  In  the  years  following,  a considerable  amount  of  theoretical  effort  was  devoted 
to  the  development  of  computational  techniques  for  obtaining  temperature  profiles  from  the  appropriate 
measurements. 

infrared  measurements  obtained  from  early  meteorological  satellites  covered  broad  spectral  bands  and  could 
not  be  used  to  obtain  vertical  profiles.  In  1964  the  first  measurements  suitable  for  temperature  profile  inversion 
were  obtained  from  a balloon-borne  spectrometer  [HUleary,  el  a/.,.  1965] ; this  work  was  followed  by  subsequent 
balloon  flights  of  other  instruments.  However,  it  was  not  until  the  launching  of  the  Nimbus  3 meteorological  satel- 
lite on  April  14, 1969,  that  suitable  measurements  were  obtained  on  a global  basis.  That  satellite  carried  two 
instruments  that  obtained  infrared  measurements  at  relatively  high  spectral  resolution:  the  satellite  infrared  spec- 
trometer (SIRS)  [Work  and  HUleary,  1969]  and  the  infrared  interferometer  spectrometer  (IRIS)  [Hanel  and  Conrath. 
1969] . The  Nimbi's  4 satellite  launched  April  9, 1970,  carried  improved  versions  of  SIRS  and  IRIS  [Hanel  and 
Conrath,  1970]  as  well  as  a selective  chopper  radiometer  (SCR).  Thus,  a large  body  of  data  now  exists  that  can  be 
used  in,  and  has  stimulated,  the  development  of  temperature  profile  retrieval  techniques. 


•NAS/NRC  Resident  Research  Associate.  Present  address:  Centre  National  d'Etudes  des 
Telecommunications,  Paris,  France. 
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A theoretical  expression  for  the  radiance  / (v)  at  wavenumber  v measured  by  a satellite-borne  instrument 
can  be  obtained  from  a solution  of  the  radiative  transfer  equation  written  in  integral  form 


Ps 

J(v)  = B(v,Ts)t{v.ps)  - (b[v,T(p)] 


?! <EgI  dp 
ap 


(1) 


where  B(v,T)  is  the  Flanck  function  at  wavenumber 
v and  temperature  T.  The  atmospheric  pressure  p has 
been  used  here  as  the  independent  height  variable,  but 
any  monotonic  function  of  p can  be  used  with  In  p 
frequently  being  found  convenient.  The  atmospheric 
transmittance  from  level  p to  the  top  of  the  atmos- 
phere for  a spectral  interval  centered  at  wavenumber 
v is  denoted  by  r(i >,p),  and  Ts  and  ps  refer  to 
surface  temperature  and  atmospheric  pressure  at  the 
planetary  surface,  respectively.  In  deriving  (1),  it  has 
been  assumed  that  the  atmosphere  is  nonscattering  and 
in  local  thermodynamic  equilibrium,  and  that  the  sur- 
face radiates  as  a black  body. 

The  first  term  on  the  tight-hand  s.de  of  (1)  rep- 
resents the  contribution  from  the  planetary  surface, 
while  the  second  term  is  the  atmospheric  contribution. 

For  sufficiently  transparent  spectral  intervals,  the 
boundary  term  must  be  specified  in  some  manner, 
using  measurements  in  the  most  transparent  parts  of 
the  spectrum.  It  will  be  assumed  that  this  can  be  done 
satisfactorily,  and  the  boundary  term  in  (1)  will  not  be 
considered  further  here.  Given  measurements  of  I(v), 
the  basic  problem  then  is  to  solve  (1)  to  obtain  an 
estimate  of  the  vertical  temperature  profile  Tip). 

To  be  able  to  solve  for  Tip ) , it  is  necessary  to 
know  the  atmospheric  transmittance  r(v,p).  There- 
fore, an  absorption  band  for  an  atmospheric  constituent 
of  known  abundance  must  be  employed  so  that  from  a 
knowledge  of  the  gaseous  absorption  coefficients,  it  is  possible  to  calculate  the  atmospheric  transmittance.  Both 
CO 2 with  absorption  bands  in  the  infrared  and  02,  which  absorbs  in  the  microwave  region  of  the  spectrum, 
satisfy  this  requirement.  Inspection  of  (1)  indicates  that  the  vertical  derivatives  of  the  transmittance,  or  so-called 
“weighting  functions.”  essentially  define  those  portions  of  the  atmosphere  sampled  by  measurements  at  various 
points  in  the  spectrum.  Examples  of  weighting  functions  for  2.8  cm-1  wide  spectral  intervals  in  the  15  pm  C02 
band  are  shown  in  figure  1. 

The  techniques  developed  for  the  solution  of  (1)  can  be  grouped  roughly  into  two  broad  categories:  statistical 
and  nonstatistical.  This  discussion  ir>  confined  to  techniques  that  do  not  utilize  extensive  a priori  statistical  informa- 
tion about  the  atmospheric  profiles. 

The  basic  inversion  problem  is  examined  fust,  and  three  methods  of  solution  typical  of  those  commonly  used 
today  are  discussed.  Intercomparisons  of  the  methods  are  made  with  data  in  the  1 5 pm  C02  band  obtained  with  the 
Nimbus  4 IRIS  instrument. 


Figure  1.-  Weighting  functions  for  the  IS  CO,  absorption 
band.  The  absolute  values  of  the  derivatives  of  the  transmit- 
tance with  respect  to  the  logarithm  of  the  atmospheric 
pressure  are  shown  for  several  spectral  intervals  2.8  cm-1 
wide. 
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METHODS  OF  SOLUTION 


In  general,  measurements  of  I(v)  are  available  for  some  finite  number  of  spectral  intervals.  If  quantities 
pertaining  to  the  spectral  interval  centered  on  wavenumber  u-  are  identified  with  the  subscript  i,  then  (1)  reduces 
to  a set  of  m equations 


'«  = 


t 

-iBilTip)] 


dp 


dp 


i = 1,2 


(2) 


In  the  form  given  in  (2),  the  equations  to  be  solved  are  nonlinear  in  the  unknown  T(p).  Most  methods  of  solution 
begin  by  linearizing  in  some  way.  (An  exception  to  this  approach  is  discussed  later.)  One  form  of  linearization 
employs  an  expansion  of  B[v,T(p)\  about  a reference  profile  T^°\p)  -that  is 


Biim)  * Bi[T<0\p)]  + 


d-‘[7^r-— 1 [7tp)_7 <o)(p)] 
d T 


(3) 


Substitution  of  (3)  into  (2)  results  in  a linearized  set  of  equations  of  the  form 


A/, 


d Bj  [ T^°\p )] 
df 


A r(p)  dp 


(4) 


where  AT(p)  = T(p)  - T^°\p ) and  A/(  = /{ - with  if0^  being  the  radiance  calculated  using  T^°\p). 
Approximating  the  integrals  by  numerical  quadrature,  (4)  becomes 


where 


n 

A//  = ^ Kjj  A Tj  i = 1,2, ...  ,m 

7=1 


(5) 


„ iBi[T<°)(Pi))  9 t,(p,) 

K::  = 

v dr  dp  l 


(6) 


with  Wj  an  appropriate  quadrature  weight.  Other  forms  of  linearization  have  also  '■  used  [Work  and  Fleming, 
1966;  Rodgen,  1966;  19701. 

When  linearization  is  employed,  the  problem  is  formally  reduced  to  that  of  finding  solutions  to  the  linear  set 
of  equations  (5).  In  the  overdetermined  case  (m  > n),  a least  squares  solution  can  be  written  in  vector  form 


AT  = (KTK)~lKTM 


(7) 
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where  AT  is  the  solution  estimate  and  the  superscript  T denotes  matrix  transposition.  In  the  atmospheric  profile 
inversion  problem,  there  will  be  generally  many  more  quadrature  points  employed  in  (5)  than  measurements.  In  this 
underdetermined  case  (m  <«),  the  family  of  solutions  satisfying  (5)  can  be  written  [Bjerhammar,  1951  ] 


AT  = KT(KKr)~l  AI  + [KT(KKT)~lK- l]b 


(8) 


where  b is  an  arbitrary  vector.  With  the  additional  constraint  that  AT*  AT  be  a minimum,  it  can  be  shown  that 
the  resulting  particular  solution  corresponds  to  b - 0 in  (8),  or 


AT  = KT(KKTyl  A I 


(9) 


which  is  the  pseudoinverse  of  Penrose  [Greville,  1959]  for  the  underdetermined  case. 

In  practice,  solutions  (7)  and  (9)  are  of  little  value  for,  the  atmospheric  profile  inverse  problem.  The  presence 
of  rn'ise  in  the  measured  radiances  renders  the  solutions  physically  meaningless.  This  is  due  to  the  high  correlation 
an,  the  measurements,  and  the  resulting  tendency  toward  redundancy  of  the  equation  set  (5). 

If  an  ensemble  of  measurements  is  considered,  the  covariance  matrix  for  the  measurements  R can  be  written 

R = <AI  AIT>  = KSKT  (10) 


where  the  angular  brackets  denote  ensemble  averaging,  and  S is  the  covariance  matrix  for  the  temperature  profile, 

t 

S = <AT  ATT>  (11) 


The  second  form  of  R in  (10)  is  obtained  using  (5).  Now  the  correlation  in  the  measurements  is  due  both  to  the 
correlation  of  the  atmospheric  temperature  among  various  atmospheric  levels  and  the  radiative  transfer  process 
itself- that  is,  the  overlap  of  the  weighting  functions.  The  correlation  due  to  the  overlap  of  the  weighting  functions 
alone  can  be  considered  by  assuming  S = o j 1 where  a j is  the  variance  in  temperature  assumed  the  same  at  each 
level.  This  results  in 


R = OjKKT  (12) 

T 

Thus,  the  matrix  KK  provides  a measure  of  the  correlation  among  the  measurements  due  to  the  overlap  of  the 
weighting  functions.  It  is  possible  to  find  a representation  for  the  “measurement  vector”  A I in  which  its  com- 
ponents are  uncorrelated  by  diagonalizing  KKJ . The  resulting  normalized  eigenvectors  u;  given  by 

= Xj-Uj-  i = 1,2,  ...,m  (13) 

provide  a convenient  basis  set  for  representing  AI  in  the  measurement  space  and  will  be  used  in  the 
subsequent  analysis. 
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The  effects  of  noise  on  (9)  will  now  be  considered.  Assume  the  measured  value  AI  consists  of  a “true”  value 
A 1,  plus  an  error  vector  e -that  is 


Al  = Al  + e 


(14) 


The  noise-contaminated  solution  calculated  from  Eq.  (9j  is 


AT  = KT(KKT)-1  AI 


(15) 


Now  a!  can  be  expanded  in  terms  of  u;- 


m 


AI 


= ^ «/u / a/  = u,  ■ 


Al 


(16) 


r'=l 


Substitution  of  Eq.  (16)  into  Eq.  (15)  gives 


AT 


m 

■ s t 


x. 

i=l  ‘ 


(17) 


Thus,  the  solution  can  be  written  as  a linear  combination  of  the  m vectors  K? u/  . These  vectors  form  an  orthog- 
onal set  but  are  not  normalized,  their  lengths  being  \/XJ\  Normalization  yields  the  set  of  vectors 


v.  = -L*ru. 


I * 1.2 m 


(18) 


A 


They  are  identical  with  the  m eigenvectors  of  the  matrix  K‘K,  which  correspond  to  nonzero  eigenvalues. 
Using  Eqs.  (14)  and  (18),  Eq.  (17)  can  be  rewritten  in  the  form 


2 


u,  • AI  + u,-  • e 


(19) 


and  the  solution  can  be  regarded  :•$  a linear  combination  of  the  vectors  v,  in  the  “profile  space.”  The  vectors  v/ 
were  calculated  for  a set  of  16  spectral  intervals  in  the  15  fun  C02  band.  The  first  eight  of  these  vectors,  along 
with  the  corresponding  eigenvalues,  are  shown  in  figure  2.  it  should  be  noted  that  there  is  a large  dispersion  in  the 
range  of  eigenvalues  corresponding  to  approximately  5 orders  of  magnitude  with  the  more  highly  structured  eigen- 
vectors associated  with  smaller  eigenvalues. 

It  is  convenient  to  write  u,*e  = lei  cos  <j>j  where  lei  is  the  magnitude  of  e,  and  $,■  is  the  angle  between 
e and  the  ith  basis  vector  u,-.  Then  Eq.  (19)  becomes 
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AT  = 


(20) 


|e|  cos  <t>\ 

r 


The  first  .term  of  the  coefficient  of  each  v,-  corresponds 
to  the  signal  portion  while  the  second  term  corresponds 
to  the  noise  portion.  Now  if  the  true  profile  is  relatively 
smooth,  A1  will  be  almost  perpendicular  to  those  u,- 
associated  with  small  eigenvalues  in  the  measurement 
space,  and  the  first  term  in  the  corresponding  coeffi- 
cients in  Eq.  (20)  will  not  be  l'~ge  even  though 
appears  in  the  denominator,  however,  since  the  noise 
vector  will  generally  be  random  in  direction  in  the 
measurement  space,  it  can  have  nonvanishing  com- 
ponents, in  the  direction  of  the  u,  associated  with 
small  eigenvalues,  that  will  be  amplified  by  1/VxJ  ■ 
Thus,  the  coefficients  of  J;ose  v(  in  the  profile  space 
that  contain  finer  structure  will  be  large,  resulting  in 
nonphysical,  large-amp  itude,  fine-scale  structure  in 
the  solutions. 

Detailed  analyses  of  the  infoimation  content  of 
noise  contaminated  measurements  employed  in  obtain- 
ing vertical  atmospheric  profiles  have  been  provided  by 
Mateer  [1965]  and  Twomey  [1965] . Examples  of  the 
gross  instabilities  occurring  when  direct  solutions  are 
employed  have  been  given  in  the  literature  [Work and 
Fleming,  1966 \Conrath,  1968]. 


Figure  2.  - Basis  vectors  employed  in  the  representation  of 
solutions  in  the  profile  space.  The  associated  eigenvalue 
is  shown  in  e ich  case. 


To  produce  physically  meaningful  solutions,  an 

inversion  method  must  be  able  to  suppress  either  explicity  or  implicity  the  contributions  from  those  eigenvectors 
associated  with  the  relatively  small  eigenvalues.  Three  computational  methods  typical  of  those  currently  being 
employed  will  now  be  discussed 


The  first  method  to  be  considered  is  based  on  a form  that  can  be  derived  as  a limiting  case  of  the  statistical 
estimation 


AT  = SKT(KSKT  +ET1  Ai  (21) 

obtained  independently  by  Rodgers  [1966]  and  by  Strand  and  Westerner  [1968  aj)  ] . The  relation  Eq.  (21)  was 
also  given  b>  "outer  [1961]  in  a different  context.  Here  AT 3 ¥- T^19)  where  T is  the  estimated  profile,  T^ 
is  the  mean  of  a representative  ensemble  of  profiles,  5 is  the  two-level  profile  covariance  matrix  obtained  from  the 
same  ensemble,  A*I  = 7- 1^1  where  are  the  radiances  calculated  using  t(°),  and  E is  the  instrumental 
noise  covariance  matrix.  The  limiting  form  of  Eq.  (21)  of  interest  here  is  obtained  by  assuming  S - oj*  1 and 
E*o\  1 ; 


AT  = KT(KKT  + yl)~l  M 


(22) 
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where  i-o\lojl  is  the  ratio  of  noise  variance  to  signal  variance.  This  is  called  the  minimum  information,  or 
maximum  entropy,  solution  by  Foster  [1961] . A similar  algebraic  form  was  derived  by  Twomey  [1963]  from  a 
nonstatistical  point  of  view  based  on  the  work  of  Phillips  [1962] . A similar  form  can  also  be  obtained  from  the 
method  of  regularization  [Glasko  and  Timofeyev,  1968] . 

When  and  7 are  not  precisely  known  from  statistics,  Eq.  (22)  can  be  used  as  the  basis  for  an  iterative 
form,  treating  as  a first-guess  profile.  The  resulting  iterative  relation  is 

T<«+1>  . tW^W^D-'IT-iWI  (23) 

where  the  parenthetical  superscripts  refer  to  the  iteration  order.  Iteration  can  continue  until  the  residuals  I - 1^ 
reach  the  instrumental  noise  level. 

The  behavior  of  this  solution  can  be  examined  by  considering  the  representation  of  the  residuals  on  the  8 th 
iterative  step  in  terms  of  the  eigenvectors  u,- 


m 

7-l(*>=  J 4%  afV  = [T-I<£>]  -u,  (24) 

i-1 

From  Eqs.  (24)  and  (23),  a recursion  relation  for  the  coefficients  aft)  can  be  derived  in  the  form 


,X*+1)  = 


-1 «/<> 

Aj+r 


(25) 


from  which  the  relationship  of  the  coefficients  on  the 


8th  iteration  to  their  initial  values  can  be  obtained 


(26) 


The  mean  square  residual  on  the  8th  iteration,  which  can  be  employed  as  a measure  of  convergence,  can  be  written 


-i(T-iW).(7-.(») 


(27) 


Note  that  for  7 > 0 the  terms  in  Eq.  (27)  will  individually  approach  zero  in  the  limit  8 The  rate  of  the 
reduction  of  each  term  with  increasing  8 depends  on  the  relationship  of  7 to  X,-.  Terms  for  which  7 « A; 
will  decrease  very  rapidly  while  those  for  which  7 » will  decrease  slowly.  Thus,  by  closing  an  appropriate 
value  of  7,  the  portion  of  the  residuals  associated  with  the  larger  eigenvalues  and  valid  information  can  be  reduced 
in  one  or  a few  iterations,  while  that  portion  associated  with  small  eigenvalues  and  noise  will  remain  essentially  un- 
reduced. By  truncating  the  iteration  at  the  proper  mean  square  residual  level,  the  noise  contaminated  modes  assxi- 
ated  with  small  X{-  values  can  be  suppressed. 
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The  solution  on  the  fith  iteration  in  terms  of  the  set  of  basis  vectors  vy  in  the  profile  space  can  be  written 


m 


tW  = x(°)  + 


(28) 


This  expression  can  be  obtained  from  Eqs.  (23)  and  (24)  by  straightforward  algebraic  manipulation.  It  can  be  seen 
that  the  coefficients  of  those  Vy  for  which  y » Xy  will  increase  very  slowly,  and  these  terms  will  not  contribute 
to  the  solution  until  many  iterations  have  been  performed.  In  the  limit  2 ■+  <»  .the  factor  in  square  brackets 
approaches  unity  for  all  y > 0 and  Eq.  (28)  reduces  to  an  expression  equivalent  to  Eq.  (19),  indicating  that  the 
iterative  algorithm  Eq.  (23)  converges  to  the  Penrose  pseudo  inverse.  In  practice,  the  iteration  is  stopped  when  the 
residuals  reach  the  instrumental  noise  level. 


Another  type  of  iterative  algorithm  can  be  obtained  in  the  following  way.  Multiplying  both  sides  of  Eq.  (5) 
by  gives 


«r> 

a 1 AI  = KJK  AT 


(26) 


Now  consider  K‘ K written  as  the  sum  of  a purely  diagonal  matrix  G and  a remainder  R ; that  is 


KTK  = G + R 


(27) 


Once  G is  specified,  R is  determined  by  Eq.  (27).  Combining  Eq.  (27)  with  Eq.  (26)  and  rearranging  gives 


G AT  = KTbl-R  AT 


(28) 


This  expression  can  serve  as  the  basis  of  an  iterative  algorithm 


at<8+1)  = G~lKT Al  - g-1£at(£) 


(29) 


where  AT^£+^  = and  AT^  = - T^.  Using  Eq.  (27)  to  eliminate  R finally  results 

in 


t(£+D  = TW  + g~'kt{T-iW) 


(30) 


The  choice  of  G determines  the  covergence  behavior  of  Eq.  (30).  Examples  of  solutions  are  given  in  the  following 
section  with  the  r.  diagonal  element  of  G set  equal  to 


m 

S 


kr-  1 


I 


KjkH 


i = 1 ,2, ....  n . 
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A third  example  of  a nonstatistical  solution  is  that  suggested  by  Chahine  [1968] . It  is  a relaxation  technique 
that  does  not  require  linearization  of  the  radiative  transfer  equation,  but  user  Gq.  (2)  directly.  In  this  approach,  the 
radiance  from  the  r'th  spectral  interval  is  paired  with  the  temperature  at  some  level  p,-.  The  algorithm  employed  is 

[r<£+1  Vf)j  - ~ *f[r(%)|  i = 1 ,2, ....  m (31) 

The  superscripts  refer  to  the  iteration  number,  and  if®  is  the  radiance  calculated  using  the  estimate  for  the  tem- 
perature profile  on  the  6th  iteration.  A first  guess  is  made  at  a temperature  profile  to  begin  the  iteration.  In  order 
to  calculate  if®  at  each  iterative  step,  the  temperatures  T^®(pj)  at  the  m atmospheric  levels  must  be  interpo- 
lated to  intermediate  levels.  The  form  of  interpolation  used,  along  with  the  quadrature  method  employed,  essen- 
tially defines  the  representation  for  the  temperature  profile. 

Iteration  is  carried  out  using  Eq.  (31)  until  some  condition  on  the  residuals  (differences  between  measured 
and  calculated  radiances)  is  met.  For  example,  iteration  may  be  stbpped  when  the  rms  residual  approaches  some 
value  determined  from  a knowledge  of  the  rms  noise  lei  el  of  the  measurements.  The  convergence  properties  of  Eq. 

(3 1 ) are  rather  obscure , and  the  degree  of  success  of  the  method  must  be  judged  empirically . There  is  obviously  no 
unique  choice  of  atmospheric  level  p,-  for  a given  spectral  interval,  but  a logical  choice  would  appear  to  be  a pressure 
level  near  the  peak  of  the  weighting  function  for  that  interval. 

A modification  of  Chahine’s  algorithm  has  been  given  by  Smith  [1970] . This  approach  employes  an  iterative 
equation 

B,(e+,)<p)  = Bf®  + ~ if®\  = 1,2 m (32) 

By  solving  the  Planck  functions  obtained  with  this  equation  for  temperature,  m different  temperature  profiles 

A px  « 

Tj  (p)  (/  = 1,2, ... , m)  are  obtained.  The  solution  is  then  defined  as  the  weighted  mean 


7<«+1)(p)  = 


*f(p)  7**+1>(p) 
i=l 


I 

/=! 


(33) 


A first  guess  at  a temperature  profile  is  made,  and  Eqs.  (32)  and  (33)  are  iterated  until  some  criterion  on  the  radiance 
residuals  is  satisfied.  This  method  does  not  require  an  arbitrary  pairing  cf  atmospheric  levels  with  spectral  intervals 
nor  does  it  require  an  explicit  interpolation  between  levels  since  a temperature  can  be  calculated  at  each  level  using 
Eq.  (33). 
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EMPIRICAL  COMPARISONS  OF  METHODS 


The  behavior  of  three  of  the  methods  discussed  above  are  now  examined  and  compared,  using  applications  to 
actual  satellite  data.  These  methods  include  the  minimum  information  estimation  Eq.  (23),  Chahine's  algorithm  Eq. 
(31),  and  the  iterative  algorithm  Eq.  (30). 

The  data  were  obtained  with  tht  Nimbus  4 IRIS  experiment  [Hanel  and  Conrath,  1970] . The  instrument  has 
been  described  in  detail  elsewhere  [ Hanel  et  at.,  1971) . The  spectral  range  extends  from  400  cm-1  to  1 500  cm-1 , 
and  the  spectral  resolution  is  equivalent  to  about  2.8  cm-1 . Selected  spectral  intervals  in  the  15  jem  (667  cm-1 ) 
CO]  band  are  used  in  the  present  study.  The  noise  equivalent  radiance  in  this  spectral  region  is  approximately  0.5 
erg/cm1  sec  sr  cm-1  (1%  of  the  signal).  The  quality  of  the  solutions  is  established  by  comparisons  with  in  situ 
measurements  obtained  with  radiosondes. 


Examples  of  Retrieved  Profiles 

The  first  examples  to  be  considered  are  solutions  obtained  with  each  of  the  three  techniques,  compared  with 
a radiosonde  profile.  Sixteen  spectral  intervals  were  employed;  weighting  functions  for  several  of  the  intervals  are 
shewn  in  figure  1.  Figure  3 shows  the  retrievals  using  the  minimum  information  method,  the  iterative  estimation 
method  Eq.  (30),  and  the  Chahine  algorithm.  In  each  case,  the  same  first  guess,  a climatological  mean  profile,  was 
used. 


The  minimum  information  solution  and  the  iterative  method  are  found  to  give  almost  identical  results  in  this 
case:  they  are  indistinguishable  on  the  scale  of  figure  3.  The  minimum  information  method  was  iterated  twice  with 
y = G.03(oK)3/(erg/cmJ  sec  sr  cm-1  )J  to  reduce  the  residuals  to  the  instrumental  noise  level.  The  temperature 
dependence  of  K was  taken  into  account  on  each  iteration.  The  solution  shown  in  figur;  3,  using  the  iterative 
method  Eq.  (30),  was  obtained  with  24  iterations,  but  is  essentially  no  different  from  that  obtained  with  only  five 
iterations. 

In  obtaining  the  retrieval  shown  in  figure  3 with  the  Chahine  algorithm,  linur  interpolation  was  employed 
between  the  16  levels  for  which  temperatures  could  be 
computed.  The  appearance  of  spurious  fine  structure 
in  the  solution  is  a problem  encountered  when  an 
attempt  is  made  to  use  a large  number  of  spectral 
intervals,  as  in  this  case.  This  structure  is  essentially 
the  onset  of  instability  and  grows  in  amplitude  as  the 
number  of  iterations  is  increased.  The  use  of  16 
spectral  intervals  results  in  a large  amount  of  redun- 
dancy with  weighting  function  peaks  separated  only 
slightly  in  many  cases.  This  makes  the  selection  of 
levels  for  calculating  temperatures  using  Eq.  (31)  a 
rather  difficult  task.  With  so  many  spectral  intervals, 
the  choice  of  criteria  for  truncating  the  iteration  be- 
comes critical.  In  studies  with  the  Chahine  method 
the  iteration  was  stopped  whenever  the  rms  residual 
dropped  below  0.8  erg/cm3  sec  sr  cm-1  or  changed 
by  less  than  0.02  erg/cm2  sec  sr  cm-1  from  the 
previous  iteration.  The  instability  problem  disappears 
when  the  number  of  spectral  intervals  is  reduced,  ai.d 
the  use  of  7 intervals  produces  better  results  than  16. 


Figure  3.  - Comparison  of  retrieval  using  three  methods.  The 
solutions  obtained  with  the  minimum  information  estima- 
tion and  the  iterative  method  are  Indistinguishable  c i 
this  scale. 
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RMS  RESIDUALS  VS. 
ITERATION  NUMBER 
C MARINE  ALGORITHM 
|1(  SPECTRAL  INTERVALS! 


Figure  4.-  Convergence  behavior  of  the  Chahine  algorithm.  The 
five  examples  shown  were  selected  from  a larger  sample  to  show 
the  range  of  behavior. 


RMS  RESIOUALS 
VS. 

ITERATION  NUMBER 
ITERATIVE  METHOD 


Figure  5.  Convergence  behavior  of  the  iterative  method.  The 
same  five  cases  employed  in  figure  4 are  shown. 


Figure  6.-  Minimum  information  estimation  retrievals  illustrating 
dependence  of  solutions  on  the  first  guess.  The  isothermal 
first  guess  was  25(f  K. 


Convergence  Behavior 

It  was  found  empirically  that  iterating  the  mini- 
mum information  estimation  more  than  twice  did  not 
appreciably  improve  the  solutions  Hence,  this  method 
can  be  done  with  considerable  computational  speed. 

The  convergence  behavior  of  the  Chahine  algo- 
rithm is  snown  in  figure  4 for  five  typical  cases.  The 
rms  residual  is  plotted  versus  iteration  number.  The 
residual  foi  the  Oth  iteration  represents  that  obtained 
with  the  first-guess  temperature  profile.  In  general, 
all  the  curves  are  characterized  by  a large  deciease  in 
the  residual  on  the  first  iteration,  followed  by  a region 
of  less  rapidly  decreasing  residual  until  the  curves  be- 
come nearly  flat.  In  some  cases  the  residuals  actually 
increase  again  after  going  through  rather  broad  minima. 
There  is  considerable  dispersion  in  the  asymptotic  be- 
havior of  the  curves,  which  appears  to  depend  on  the 
nature  of  the  profile,  the  first  guess,  and  the  behavior 
of  the  noise  in  each  case. 

The  best  results  are  obtained  when  the  iteration 
is  stopped  at  some  point  where  the  residuals  are  begin- 
ning to  decrease  less  rapidly  but  have  not  yet  reached 
the  flat  part  of  the  curve.  However,  the  optimum 
point  for  stopping  the  iteration  appears  to  vary  from 
case  to  case,  making  it  difficult  to  establish  a general 
set  of  convergence  criteria.  The  quality  of  a solution 
is  found  to  decrease  rapidl  s the  flat  part  of  the 
curve  is  approached. 

The  change  in  tns  residual  with  iteration  number 
for  the  iterative  method  is  shown  in  figure  5 for  the 
same  five  cases.  The  general  shape  of  the  curves  is 
qualitatively  similar  to  those  obtained  with  the 
Chahine  algorithm  but  with  a somewhat  more  abrupt 
transition  to  the  flat  portion  of  the  curve.  The  solu- 
tions were  found  not  to  change  significantly  after 
iteration  4 or  5,  and  the  quality  remained  good  out 
through  iteration  24,  which  was  as  far  as  the  calcula- 
tions were  carried.  Thus,  the  criteria  for  stopping  the 
iteration  are  not  critical  with  this  method. 

Dependence  of  Solutions  on  the  First  Guess 

Since  ail  three  solutions  require  a first-guess 
temperature  profile,  it  is  of  interest  to  investigate  ‘he 
sensitivity  of  the  final  solutions  to  the  first  guess.  For 
this  purpose,  a case  was  chosen  in  the  tropics  where 
the  temperature  profile  has  a very  sharp  minimum 
(tropopause).  This  represents  essentially  fine  struc- 
ture and  as  such  is  generally  difficult  to  retrieve 
accurately. 
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Solutions  were  obtained  w.tii  each  of  the  three  methods  using  two  extremes  in  first  guess:  (1)  a climatological 
mean  temperature  profile  having  nearly  the  same  shape  as  the  actual  profile,  and  (2)  an  isothermal  profile  at  250°  K. 
Figure  6 shows  the  results  of  this  test  for  the  minimum  information  estimation  Eq.  (23),  with  the  iterative  method 
Eq.  (30)  and  Chahine's  algorithm  Eq.  (31)  giving  essentially  similar  results. 

Good  iesults  were  obtained  with  all  three  methods  using  the  climatological  first  guess.  With  the  isothermal 
first  guess,  the  gross  structure  of  the  profile  is  retrieved.  The  essentially  linear  part  of  the  profile  in  the  lower  atmos- 
phere (troposphere)  is  fit  quite  accurately.  However,  the  solutions  are  about  10°  K too  warm  at  the  tropopanse, 
and  th,'  shaip  structure  is  not  retrieved. 


SUMMARY 


The  mathematical  techniques  employed  in  obtaining  atmospheric  profiles  have  undergone  extensive  evolution 
since  the  early  attempts  at  direct  matrix  inversion  which  resulted  in  highly  unstable  solutions.  Various  methods 
have  been  developed  for  obtaining  stable  solutions  which  are  physically  meaningful.  These  solutions  generally 
employ  constraints,  either  implicity  or  explicitv,  that  may  involve  the  use  of  extensive  information  on  the  statistical 
properties  of  the  profile  or  may  use  only  a knowledge  of  the  general  properties  of  the  profile  (for  example- its 
smoothness).  Both  general  approaches  have  recently  been  employed  in  the  reduction  of  infrared  sp^ctrr'  data 
acquired  from  satellites  [Smithetal.,  1970;  Wark,  1970;  Conrath  et  al.,  1970].  The  present  review  has  been 
limited  to  techniques  that  do  not  use  extensive  statistical  information. 

Constrained  forms  of  matrix  inversion  have  been  developed  through  the  work  of  Philli|.  .1  Twomey , and 

more  recently  the  so-called  “minimum  information  estimation,”  closely  related  to  the  solutions  given  by  Twomey, 
has  been  used.  The  minimum  information  solution  can  be  viewed  as  either  a limiting  form  of  a statistical  estimation 
or  a constrained  form  of  pseudoinversion  for  an  underdetermined  system.  Several  iterative  techniques  have  been 
developed,  including  that  of  Chahine,  which  uses  ratios  of  measured  to  calculated  radiances,  and  other  methods 
using  differences  between  measured  and  calculated  radiances. 

Three  methods  typical  of  those  currently  in  use  were  described  and  compared  using  actual  satellite  data  The 
performance  of  the  methods  also  was  compared  against  in  situ  measurements  obtained  with  radiosondes.  It  should 
be  emphasized  that  the  empirical  results  obtained  here  pertain  to  the  IS  u m C02  band  and  could  conceivably  be 
altered  if  other  spectral  bands  were  considered. 

When  large  quantities  of  data  are  to  be  reduced  for  the  terrestrial  atmosphere  on  a global  basis,  computational 
speed  is  of  utmost  importance.  The  minimum  information  estimation  is  rapid,  and  therefore  attractive  for  this 
application,  because  it  lequnes  few  iterations.  Increasing  the  number  of  spectral  intervals  increases  the  compua- 
tional  time  tequired  for  all  methods,  and  this  factor  must  be  weighed  against  any  possible  gain  through  iedundancy. 

Study  of  the  covergence  behavior  of  the  Chahine  algorithm  indicates  that  the  choice  of  criteria  for  truncating 
the  iteration  is  very  important.  For  the  iterative  method  Eq.  (30),  the  results  are  insensitive  to  the  number  of 
iterations  within  a rather  broad  range.  A study  of  the  dependence  of  the  methods  on  the  first  guess  indicates  that 
only  the  gross  features  of  the  profile  can  be  retrieved  if  a poor  first  guess  is  employed,  such  as  an  isothermal  atmos- 
phere. This  aspect  is  cf  particular  interest  when  the  .application  of  the  techniques  to  spectra  from  other  planets  is 
considered.  An  example  of  such  an  application  is  the  retrieval  of  profiles  for  the  Martian  atmosphere  from  spectra 
to  be  obtained  with  the  IRIS  instruments  flown  on  the  1971  Mariner  missions  [ Hanel  et  al.,  1970] . 

In  this  review  of  the  mathematics  of  atmospheric  profile  inversion,  several  factors  in  reducing  data  for  the 
earth's  atmosphere  on  a global  basis  hive  not  been  discussed.  Chief  among  these  is  the  problem  of  a partially  c'oud- 
fllled  instrumental  field  cf  view.  In  this  situation,  the  radiances  must  be  corrected  so  the  cloud  effects  are  removed 
before  the  techniques  described  can  be  applied.  The  correction  is  accomplished  using  information  from  c Lier  spec- 
tral intervals  or  information  derived  from  spatial  scanning  [Smith,  1967;  Smith  etaL,  1970;  Rodgers,  1970] . 
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While  c .sly  the  retrieval  of  temperature  profiles  has  been  considered  in  detail  here,  the  same  mathematical 
techniques  can  be  employed  to  retrieve  the  vertical  profiles  of  optically  active  gases.  Once  the  temperature  profile 
has  been  obtained  from  measurements  in  the  absorption  band  of  a gas  whose  abundance  is  known,  measurements  in 
absorption  bands  of  other  gases  such  as  water  vapor  and  ozone  yield  information  on  those  gases.  Adaptation  of 
techniques,  such  as  those  described  here,  to  the  problem  of  retrieving  information  on  atmospheric  constituents  has 
been  described  in  the  literature  [Conrath,  1969;  Prabhakara,  1969;  Smith,  1970]. 
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DISCUSSION 


Strand:  I’d  like  to  ask  about  the  strong  dependence  of  the  initial  guess.  Why  do  you  lose  all  this  information  and 

never  get  it  back  when  you  start  with  a poor  initial  guess? 

Conruth:  The  structure,  like  the  tropical  tropopause.  is  not  contained  in  your  measurements.  You  need  an  initial 

guess  containing  this. 

Waters:  In  applying  this  technique  to  atmospheres  other  than  that  of  the  earth,  how  well  do  you  know  the  kernel 

in  the  integrand? 

Conrath:  You  are  on  shakier  ground.  First,  you  don’t  know  the  mixing  ratio  precisely.  This  upsets  the  relation- 

ship between  the  optical  depth  and  pressure  level  and  introduces  profile  distortions.  Also  with  low  pressures 
and  high  mixing  ratios  (Mars)  we  are  extrapolating  quite  ? bit  with  our  theoretical  representations  of  the 
absorption  coefficients.  This  bring?  in  other  uncertainties,  too. 

Falcone : If  you  use  the  Bayesian  technique  and  you  put  in  your  a priori  knowledge,  then  you  can  look  at  your 

variance  to  see  how  it  is  changing.  You  will  get  better  knowledge  that  way. 

Conrath : You  essentially  have  no  real  a priori  knowledge.  Ail  you  have  is  the  variance  of  your  measurements. 

Unidentified  speaker:  On  your  curves  you  presented  of  the  rms  radiance  residuals  versus  iteration  number,  have 

you  looked  at  what  your  rms  temperature  errors  do? 

Conruth:  Yes.  For  the  Chahine  type  relaxation  algorithm  if  you  keep  iterating  out  on  the  flat  part  of  the  curve 

the  temperature  errors  go  through  a minimum  and  then  go  up.  For  the  other  method  it  doesn’t  make  a diff- 
erence and  you  get  a very  flat  minimum. 

McOatchey:  From  what  you  have  shown  it  appears  that  the  problem  is  solved.  All  of  the  methods  gave  very  good 

answers.  Does  that  mean  we  can  get  the  temperature  information  we  want? 

Conrath:  That  is  a very  strong  statement  and  l am  reluctant  to  agree.  The  subject  of  cloudy  atmospheres  is  still 

a big  problem. 

Chahine:  I would  like  to  comment  on  the  degree  of  vertical  resolution  we  can  get  from  a given  set  of  measure- 

ments. The  frequencies  used  to  sound  the  tropopause  are  very  broad  and  because  the  tropopause  temperature 
is  lower  than  above  or  below  it,  a large  amount  of  outgoing  radiance  comes  from  the  region  above  and  below 
the  tropopause.  Therefore,  the  inverse  problem  will  not  reconstruct  the  tropopause.  Methods  should  be 
developed  to  measure  outgoing  radiance  with  a high  degree  of  accuracy  and  high  spectral  resolution. 

McOatchey:  I have  generated  from  a series  of  real  atmospheric  temperature  soundings  radiance  distributions  for 

both  the  15p  and  cases  tor  every  two  wavenumbers  in  each  band  and  using  a set  of  transmission  functions 
which  1 would  be  glad  to  piuvide.  1 would  like  someone  to  do  the  inversion  and  I’ll  keep  the  numbers  locked 
up.  This  should  be  a very  good  and  interesting  test  of  various  techniques. 
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ABSTRACT 


Veitical  temperature  profiles  are  derived  from  radiation  measurements  by  inverting  the  integral  equation  of 
radiative  transfer.  Be  can*-  of  the  nonuniqueness  of  the  solution,  the  particular  temperature  profde  obtained  depends 
on  the  numerical  inversion  technique  used  (that  is,  matrix  inversionor  iteration)  and  the  type  f auxiliary  informa- 
tion incorporated  in  the  solution  (that  is,  statistical  or  initial  data).  The  choice  of  an  inversion  algorithm  depends 
on  many  factors;  including  the  speed  and  size  of  computer,  the  availability  of  representative  statistics,  and  the 
accuracy  of  initial  data.  This  paper  presents  results  of  a numerical  study  comparing  two  contrasting  inversion 
methods:  the  statistical-matrix  inversion  method  and  the  nonstatisticai-iterative  method.  These  were  found  to  be 
the  most  applicable  to  the  problem  of  determining  atmospheric  temperature  profiles.  Tradeoffs  between  the  two 
methods  are  discussed. 


INTRODUCTION 


The  determination  of  vertical  temperature  profiles  from  multispectral  satellite  measurements  of  the  earth’s 
radiance,  in  an  absorption  band  of  a known  gaseous  constituent,  requires  the  inversion  of  the  integral  equation  of 
radiative  transfer 


I(v)  = B[v,T{x0)]r{yx0)  - 


B[vT(x)) 


d t(v,x) 

6x 

dx 


0) 


The  spectral  radiance  measured  from  space  is  giver  by  /(v),  B(y,  T)  is  the  Planck  radiance  at  frequency  v and 
temperature  T,x  is  any  single-valued  function  of  pressure  p,  and  r(v,x)  is  the  fractional  transmittance  of  the 
atmosphere  above  the  level  x for  radiation  at  frequency  v.  In  this  discussion  x(p)  = a (Ir.(lOOp)]  2 + 1 
where  a - 99/[ln(100,000)]  . The  zero  subscript  denotes  surface  boundary  values.  The  boundary  term  is  determined 
from  atmospheric  “window”  measurements.  The  solution  T(x)  is  obtained  from  the  Planck  radiance  B [i>,  7Xx)] 
assuming  the  kernel  of  dr/dx  of  Eq.  (1)  is  known.  Figure  1 shows  the  nine  components  of  the  kernel  (or  Planck 
radiance  “weighting  functions”)  for  the  spectral  intervals  of  a future  operational  vertical  sounder.  The  Planck 
radiance  is  given  by 


B(y,T)  * c,v3/[exp(c2v/r)- 1] 


(2) 


where  t'i  and  c2  are  constants  of  the  Planck  function. 
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Fleming  and  Smith  [1971]  give  the  details  of  various  numerical  algorithms  for  solving  Eq.  (1)  for  T(x). 
Briefly,  the  solution  of  Eq.  (1)  requires  measurements  /(v)  to  a certain  degree  of  accuracy  at  a finite  number  of 
frequencies  and  knowledge  of  t(i>,x).  Because  of  the  conservative  nature  of  the  atmosphere,  one  begins  with  an 
initial  profile  for  7'(x)-a  climatological  mean  or  a forecast  profile- that  redLces  the  problem  to  solving  for  the 
temperature  deviation  rather  than  for  its  entire  magnitude.  The  Planck  function  is  then  linearized  with  respect  to 
frequency  about  some  intermediate  reference  frequency. 

The  resulting  matrix-vectorial  approximation  of  the  radiative  transfer  equatioi  is 


r = Ab  + c 


(3) 


where  r is  a column  vector  of  differences  of  observed  radiances  from  radiances  given  by  the  initial  data;  A is  an 
MX  N matrix  whose  elements  are 


aH 


-an 


ir(vj,Xf) 

dx 


•i  = 1 ,...,M 
i = i,...,iV 


(4) 


where  M is  the  nui  >ber  of  measurement  frequencies  and  N is  the  number  of  pressure  levels.  The  elements  of  the 
column  vector  b are 


bj  = B[v0,Tj]  - B\v0,Tj\ 


(5) 


and  the  observational  errors  are  represented  by  the  row  vector  e.  The  Planck  function  linearization  factors  in 
Eq.  (4)  are  given  by 


B(  vi  ,Tj) 

biXj) 

mviJjMtTj 

. dB(v0,Tj)ldTj 


for  Tj  Tj 


for  Tj 


(6) 


where  v0  is  the  reference  frequency  and  Tj  is  the  initial 
temperature  at  the  ;th  pressure  level.  Since  Tj  in  Eq. 
(6)  is  unknown,  Ts  must  be  used  initially,  but  on  succes- 
sive iterations  the  latest  estimate  of  Tj  can  be  used  in 
estimate  o ■■ . The  inverse  problem  is  now  reduced  to 
solving  Eq.  v.3)  for  b since  the  temperature  profile  T can 
be  obtained  from  the  unique  inverse  of  the  Planck  func- 
tion Eq.  (2). 


Equation  (3)  cannot  be  solved  directly  for  b in  that 
it  is  unstable  because  of  the  near  singularity  of  the  matrix 
A.  The  near  singularity  is  due  to  the  lack  of  vertical  in- 
dependence of  the  weighting  functions  (fig.  1).  Numerous 
indirect  methods  of  solving  Eq.  (3)  have  been  presented 


Figure  1.-  Derivative  of  transmittance  (weighting  functions) 
versus  p rewire  for  the  spectral  wave  number!  indicated. 
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in  the  literature.  However,  since  the  solution  of  Eq.  (3)  is  nonunique,  the  solution  one  obtains  depends 
on  the  method  used. 

METHODS 

Two  very  different  methods  of  solving  Eq.  (3)  for  b are  given  in  Fleming  and  Smith  [1971]:  the  statistical 
matrix  inversion  method  and  the  nonstatistical  iterative  method.  The  statistical  method  yields  the  scluiicn 


b * C r 


(7) 


in  which  the  coefficient  matrix 


C = Sb  Ar[A  Sb  Ar  + Scr‘  (8) 

where  and  Se  are  covariance  matrices  of  b and  e.  The  superscripts  T and  -1  denote  the  matrix  transpose 
and  matrix  inverse,  respectively.  In  Eq.  (7)  the  elements  of  b are  the  deviations  of  the  actual  temperature  from  the 
mean  temperature  values  obtained  from  the  statistical  sample  of  soundings  used  to  estimate  , and  r is  the  vector 
of  deviations  of  the  measured  radiances  from  the  radiances  corresponding  to  the  mean  temperature  profile. 

The  general  iterative  method  utilizes  tike  solution 

b(n+l)  = cr<n)  (9) 


where  the  parenthetical  superscripts  (n)  represent  the  nth  step  in  the  iterative  procedure.  The  coefficient  matrix 

C = D KT  (10) 

where  D is  a diagonal  matrix  with  diagonal  elements 


In  Eq.  (9)  b and  r are  deviations  of  atmospheric  temperature  and  measured  radiances  as  in  the  previous  paragraph, 
but  now  the  deviations  are  from  the  values  calculated  in  the  previous  step  of  the  iteration. 

Basic  differences  in  the  two  methods  of  solution  are  readily  apparent  from  a comparison  of  the  coefficient 
matrices  given  in  Eqs.  (8)  and  (10).  The  iterative  method  avoids  the  calculation  of  an  inverse  matrix,  but  at  the  ex- 
pense of  iteration.  On  the  other  hand,  the  statistical  method  utilizes  statistical  covariance  matrices  and  an  inverse 
matrix  to  achieve  a one-step  solution  for  the  departure  of  atmospheric  temperature  from  a fixed  mean  profile.  The 
iterative  method  is  free  to  use  an  arbitrary  initial  profile  and  attempts  to  successively  improve  the  solution  through 
iteration.  Also,  this  method  is  easier  to  implement  than  the  statistical  method  since  a priori  statistics  are  not  required. 
However,  the  iterative  method  is  more  time  consuming  than  the  statistical  method  due  to  the  number  of  iterations 
required. 
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RESULTS 


The  results  presented  here  were  obtained  from 
calculations  previously  described  by  Fleming  and  Smith, 
[1971]  in  their  numerical  comparison  of  the  two  inver- 
sion methods.  The  procedure  was  essentially  as  follows. 
Radiances  were  calculated  by  Eq.  (1)  from  a diverse 
sample  of  109  tempe'ature  profiles  between  15°N  and 
90°N  latitude.  Random  noise,  distributed  normally  with 
a standard  deviation  of  0.2  erg/ (cm2  sec  sr  cm-1 ),  was 
added  to  the  radiances  to  simulate  real  measurements. 
The  109  profiles  were  then  retrie'  ed  from  the  perturbed 
radiances  by  the  statistical  and  the  general  iterative 
methods,  Eqs.  (7)  and  (9). 

Covariance  matrices  and  ‘heir  corresponding 
mean  profiles  T were  determined  for  four  latitudinal 
zones- 15-30°N,  3045°N,  4S-60°N,  60-90°N-in  an  in- 
dependent study  that  did  not  utilize  any  of  the  109 
profiles  used  in  the  present  study.  The  mean  profiles 
used  in  the  statistical  method  were  also  used  as  initial 
profiles  in  the  iterative  method  for  consistency.  The 
noise  covariance  matrix  Se  was  simplified  to  the 
diagonal  form  o*I  with  a£  = 0.2.  The  covergence 
criteria  of  the  iterative  method  was  based  on  a£  = 0.') 
as  an  upper  bound. 


WEAN  ABSOLUTE  TEMPERATURE  ERROR  C 

Figure  2.-  Mean  absolute  temperature  errors  as  a function  of 
pressure  for  S-9(f  N.  Solid  line  and  dashed  line  correspond 
to  the  twr  solutions  indicated  by  the  numbers. 


Figure  2 shows  the  mean  absolute  temperature  error  associated  with  the  two  methods  as  a function  of  pressure. 
The  accuracy  of  both  methods  is  relatively  poor  in  the  layers  of  the  atmosphere  where  abrupt  changes  in  lapse  rate 
exist;  for  instance,  below  700  mb  where  surface  inversions  occur,  within  the  300  to  70  mb  layer  where  the  tropopause 
prevails,  and  in  the  upper  stratosphere  where  the  stratopause  temperature  reversal  exists.  In  the  layers  where  constant 
lapse  rates  of  temperature  generally  prevail  the  accuracies  of  both  methods  is  dose  to  1°C  even  though  the  simulated 
radiances  were  contaminated  with  realistic  noise.  The  general  saw-toothed  nature  of  the  curves  is  related  to  the  fact 
that  individual  members  of  the  set  of  109  soundings  contain  sharp  temperature  inversions  that  occur  randomly  in 
layers  having  thicknesses  below  the  resolution  provided  by  the  weighting  functions. 


Figure  3 shows  the  average  difference  of  the  abso- 
lute errors  of  the  two  methods  for  four  latitude  belts 
within  the  1 5-90°N  region.  The  error  difference  was 
computed  by  subtracting  the  average  absolute  error  of 
the  statistical  solution  from  the  average  absolute  error  of 
the  nonstatistical  method  for  each  latitude  region.  Con- 
sequently, layers  of  positive  difference  are  layers  in  which 
the  statistical  solution  is  superior  to  the  nonstatistical 
solution. 

In  viewing  the  difference  curves  for  the  four  lati- 
tude belts,  one  sees  that  he  only  vertical  layer  possessing 
a consistently  one-sided  difference  is  the  tropopause 
layer  (300  to  70  mb).  In  this  region  of  the  atmosphere 
the  average  accuracy  of  the  statistical  solution  exceeds 


OtMi*|NCt  Of  Aisotut*  |MO«  C 
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Figure  3.-  Difference  of  absolute  error  between  nonstatistical 
and  statistical  methods  for  polar,  subpolar,  midlatitude  and 
subtropical  areas. 
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Figure  4.-  Temperature  profiles  as  a function  of  pressure  for 
White  Horse,  Y.  T.  The  RAOB,  guessed  profile,  and  the  two 
solutions  are  identified  by  the  numbers. 


that  of  the  nonstatisticai  solution  by  as  much  as  1°C. 
(The  upward  progression  of  the  tropopause  toward  lower 
pressures  with  decreasing  latitude  is  quite  evident.) 
Statistics  are  expected  to  improve  the  accuracy  in  this 
layer  because  the  small-scale  tropopause  features  (its 
height  and  change  in  temperature  lapse  rate)  are  known 
to  be  highly  co<  related  with  broader  scale  characteristics, 
such  as  average  lapse  rate  and  vertical  mean  temperature, 
of  the  troposphere  and  stratosphere.  It  is  somewhat 
surprising  that  the  statistics  seem  to  play  an  insignificant 
role  within  other  regions  of  the  atmosphere,  except  in 
the  upfer  polar  stratosphere.  This  indicates  that  smaller 
scale  vertical  features  outside  the  tropopause  layer  are 
not  well  correlated  with  features  that  can  be  resolved  by 
the  radii  nces.  In  fact,  there  are  regions-  for  example 
the  layers  centered  around  10  mb  and  800  mb  levels- 
where.thi:  nonstatisticai  solutions  are  more  accurate  than 
the  statistical  solutions.  The  adverse  role  of  statistics  is 
caused  by  the  occurrence  of  statistically  anamolous 
temperature  profile  features. 


fhe  role  of  statistics  in  the  temperature  profile 
solution  is  more  dramatically  demonstiated  on  a profile- 
by-profile  basis.  Figures  4 and  S show  the  individual 

profiles  in  which  the  largest  positive  differences  and  the  largest  r?gative  differences,  respectively,  of  figure  3 occurred. 
The  actual  temperature  profile  (denoted  RAOB  for  radiosonde  observation)  shown  in  figure  4 is  characterized  by  a 
very  sharp  and  cold  tropopause.  The  tropopause  layer  is  less  than  100  mb  thick  and  apparently  below  the  resolution 
of  the  radiance  observations  since  the  nonstatisticai  solution  failed  to  exhibit  this  tropopause  feature.  However,  the 

statistical  solution  partially  reveals  the  tropopause  be- 
cause this  feature  commonly  exists  in  temperature  pro- 
files at  this  latitude.  The  accuracy  of  the  two  solutions 
outside  the  tropopause  region  is  nearly  the  same.  Both 
solutions  capture  the  low-level  surface  inversion  and  the 
abrupt  change  in  lapse  rate  near  the  10  mb  level. 

Figure  S shows  a temperature  profile  that  is  statis- 
tically anomalous  for  its  latitude.  In  this  situation  the 
statistical  solution  exhibits  erroneous  tropopause  struc- 
ture as  well  as  an  erroneous  stratopuuse  near  the  2 mb 
level.  This  erroneous  structure  caused  relatively  large 
errors  throughout  the  entire  stratosphere.  The  nonsta- 
tisticai solution,  on  the  other  hand,  is  in  excellent  agree- 
ment with  the  actual  profile  throughout  most  of  the 
atmosphere. 


Figure  5 - Temperature  profiles  u a function  of  pressure  for 
Argentia,  Nfld.  The  RAOB,  guessed  profile,  and  the  two 
solutions  are  identified  by  the  numbers. 


CONCLUSION 


In  comparing  the  statistical  and  nonstatisticai  in- 
version methods,  the  major  conclusion  to  be  drawn  is 
that  statistics  generally  improve  the  retrieval  of  the 
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temperature  profile  in  the  tropopause  region.  In  other  regions  of  the  atmosphere  with  the  exception  of  the  polar 
stratosphere,  statistics  do  not  appear  to  significantly  improve  the  accuracy  of  the  temperature  profile  solution.  In 
statistically  anomalous  situations,  the  use  of  statistics  can  degrade  the  solution  for  the  atmospheric  temperature 
profile. 

The  above  conclusions  are  not  necessarily  valid  when  one  uses  better  initial  profiles  or  a different  number  of 
spectral  radiance  observations  in  the  solution.  In  setting  up  an  operational  program  to  do  temperature  ictrievals,  the 
authors  found  no  significant  improvement  with  statistics  when  using  a 1 2-hr  forecast  as  the  initial  profile.  On  th* 
other  hand,  we  have  also  found  that  statistics  tend  to  play  a more  significant  role  when  only  a few  radiance  obser- 
vations are  available  for  the  temperature  profile  solution. 


i— ...  . 


DISCUSSION 


Chahine:  The  general  iteration  method  is  simply  t nonlinear  ulaxation  method  written  in  vector 

notation.  Concerning  the  minimum  information  method,  or  Twomey  method,  all  methods  that  tend  to 
linearize  the  Planck  function  by  writing  it  as  a function  of  frequency  multiplied  by  a function  of  temperature 
are  accurate  only  if  measurements  are  made  in  a very  narrow  frequency  range  so  that  we  can  take  the  depend- 
ence of  the  Planck  function  on  frequency  as  the  value  at  the  mean  point  between  the  two  extremes.  However 
we  are  looking  into  a complicated  atmosphere  with  clouds,  etc.,  and  we  would  like  to  get  all  possible  informa- 
tion not  only  from  one  frequency  range.  In  that  case  linearization  of  the  Planck  function  would  not  apply. 

Falcone:  In  the  microwave  region  the  problem  is  linear. 

Parry  As  a meteorolhist  1 have  had  experience  in  using  these  data.  Some  have  been  saying  that  the  most  probable 
solution  is  the  best  solution.  If  this  were  the  case  for  operational  purposes,  then  it  would  not  be  necessary  to 
make  observations.  We  could  simply  use  the  climatology  to  make  early  forecasts.  Tht  unusual  situation  is  the 
important  situation. 

Smith:  The  best  solution  is  one  that  satisfies  your  measurements,  given  all  the  other  information  you  know.  This 

additional  information  could  be  contained  in  statistical  covariance  matrices  or  what  is  generated  in  dynamical 
forecast  models. 

King:  Certainly  any  information,  statistical  or  not,  should  be  considered  and  used.  However,  critical  is  the  matter 

of  where  you  use  it.  If  you  feed  the  statistical  information  in  at  the  beginning  of  your  algorithm,  the  results 
are  always  a little  unconvincing  because  you  don't  know  whether  you  are  regurgitating  what  you  put  in  or 
getting  new  information.  There  is  also  the  problem  that  with  the  given  set  of  radiances  there  are  infinitely 
manifold  possible  solutions.  The  problem  is  choosing  among  one  of  many,  and  the  statistical  criterion  has 
been  used  as  the  basis  for  a uniqueness  criterion.  This  is  not  the  only  one.  We  want  some  external  condition. 
Another  external  constraint  would  be  to  choose  from  all  of  these  possible  profiles  the  smoothest  one,  in  the 
geodesic  sense.  This  is  an  equivalent  radiometric  profile,  not  the  actual  profile,  but  it  is  a completely  unbiased 
one.  You  may  then  insert  your  statistical  knowledge  to  correct  that  profile. 
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ABSTRACT 


The  relaxation  method  for  inverse  solution  of  the  full  radiative  transfer  equation  is  generalized  to  solve  foi  all 
the  atmospheric  parameters  that  appear  in  the  integrand  as  functions  or  functionals,  without  any  a priori  information 
about  the  expected  solution.  Illustrations  are  presented  using  the  7.5p  CH4  band  for  determining  temperature  pro- 
files in  the  Jovian  atmosphere,  and  the  6.3/u  band  for  determining  the  water  vapor  mixing  ratio  in  the  earth's 
atmosphere. 


INTRODUCTION 


The  relaxation  method  of  solution  [Chahine,  1968,  1970]  developed  for  the  determination  of  a class  of 
atmospheric  parameters  appearing  mainly  as  functions  in  the  integrand  of  the  radiative  transfer  equation,  such  as 
temperature  and  constant  composition  mixing  ratios,  has  been  generalized  [Chahine,  1971  ] for  the  determination 
of  the  class  of  parameters  that  appear  as  functionals  in  the  integrand-for  example,  variable  composition  structures. 

Here  we  outline  the  general  nature  of  the  relaxation  method,  and  then  conduct  several  numerical  experiments 
for  determining  the  temperature  profile  in  the  atmosphere  of  Jupiter,  and  for  determining  the  water  vapor  mixing 
ratio  profile  in  the  earth’s  atmosphere.  The  results  discussed  here  are  based  in  part  on  Chahine  [1971] . 


THEORETICAL  BACKGROUND 


The  outgoing  radiance  at  a frequency  v,  received  at  level  p of  a plane,  parallel  atmosphere  in  local 
thermodynamic  equilibrium,  is  given  by  the  radiative  transfer  equation  in  its  integral  form  as 


/O')  = B[v,  T(ps)]T(v,ps)  + 


( 

InPs 


lnp 

B[p,nP)) 


br 
3 In  p 


d lnp 


(1) 


assuming  a Planck  blac.'.  body  boundary  condition  at  the  surface  ps  with 
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The  surface  emissivity  is  assumed  to  be  unity,  and  t(»\  (q(p)),  . . . ) is  the  transmittance  of  a column  of 
absorbers  between  levels  p and  p. 

-V  ^ 

From  an  appropriate  set  of  J observed  outgoing  radiance  values  /(»>,•, p),  we  aim  to  determine  the  unknowns, 
such  as  temperature  T(p)  or  composition  <?(p),  by  inverse  solution  of  Eq.  (1).  The  extraction  of  these  unknowns 
from  under  the  sign  of  integration  is  difficult  because  it  requires  the  solution  of  a nonlinear  integral  equation  with 
fixed  limits.  Part  of  the  difficulty  can  be  related  to  the  t „ct  that  the  mathematical  properties  of  the  integrand  of 
Eq.  (1)  precludes  the  reduction  of  this  problem  to  the  inverse  solution  of  a linear  system  of  equations 

£ V/  (2) 

A comparison  between  the  properties  of  Eqs.  (1)  and  (2)  reveals  that  Eq.  (2)  always  has  a solution  for  any 
arbitrary  set  of  given  values  of  7(v,),  while  integral  equations  with  fixed  limits  as  Eq.  (1)  do  not  always  have  a 
solution  if  I(v)  is  arbitrary.  And  in  this  problem  / is  arbitrary,  because  it  is  obtained  from  measurements  that 
include  noise,  thereby  making  the  reduction  of  Eq.  (1)  to  Eq.  (2)  mathematically  improper. 

Furthermore,  the  linear  system  resulting  from  Eq.  (2)  is  always  highly  ill-conditioned;  for  example,  if  the 
unknown  on  the  right-hand  side  of  Eq.  (2)  is  temperature,  the  condition  number  of  matrix  A is  10s  and  107  in  the 
13/i  and  4.3p  C03  bands,  respectively.  Therefore,  no  temperature  profile  can  be  determined  from  the  solution  of 
such  a <inear  system  without  the  use  of  strong  damping  functions;  when  the  damping  functions  are  empirical,  the 
results  become  arbitrary  and  devoid  of  any  meaning  associated  with  solutions  of  the  radiative  transfer  equation. 

Thus,  under  the  conditions  imposed  l y this  problem,  a nonlinear  approach  is  mandatory  in  order  to  achieve 
physically  and  mathematically  meaningful  results.  The  relaxation  method  of  solution  applied  in  this  paper  is  non- 
linear and  is  capable  of  discriminating  between  noise  and  true  information,  without  any  data  smoothing. 

The  problem  to  be  solved  by  the  method  of  relaxation  reduces  to  finding  values  of  T(p)  or  q(p)  that  satisfy 
the  equation 


lnp 

7(Pj)  = B(i>h  TsM»,,ps)  i f B[vh  1\p)\  din p (3) 

\nps 

for  all  values  of  Tfsj).  The  solution  will  be  obtained  by  iteration  starting  from  an  arbitrary  initial  guess  (n  = 0).  If 
the  initial  guess  7^n\p)  or  q^{p)  is  the  solution,  the  residuals  R^n\ t>,)  will  be  satisfied  for  all 
frequencies  with 


%i)  - MX*) 

%i) 


e 


(4) 


But  if  Eq.  (4)  is  not  satisfied  we  generate  a new  guess  using  an  appropriate  relaxation  equation. 

To  derive  the  needed  relaxation  equation  we  will  consider  first  the  simple  case  when  r is  weakly 
dependent  on  changes  in  temperature,  which  permits  the  integrand  to  be  treated  as  a function  of  T(p). 
If  r(o^(  p)  does  not  lead  to  zero  residuals  for  all  values  of  Vj , we  proceed  to  satisfy  Eq.  (3)  for  each  frequency 
separately.  We  multiply  the  right-hand  side  of  Eq.  (3)  by  the  ratio  /(«',) to  get 
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and  generate  a new  guess  for  the  Planck  function  from  the  expression  between  brackets  as 


B 


(6) 


Next,  we  pair  each  frequency  v,  with  a specific  pressure  level  p,,  such  that  a frequency  t>j  is  designated  to  sound 
the  atmosphere  at  the  level  p/  from  which  a substantial  portion  of  its  up  welling  thermal  energy  originates,  and 
use  Eq.  (6)  as  a relaxation  equation  to  determine  at  the  J pressure  levels.  Then,  we  use  a suitable 

interpolation  formula,  substitute  the  new  temperature  profile  into  Eq.  (3),  and  repeat  the  relaxation  process  until 
the  residuals  approach  zero. 


Equation  (6)  shows  that  pairing  of  frequencies  and  pressure  levels  is  valid  everywhere  except  for  B = 0. 
Therefore,  when  the  given  function  is  exact  and  continuous  the  resulting  solution  should  be  unique,  if  it  exists. 
However,  when  I(v)  is  given  at  a discrete  number  of  frequencies  we  can  no  longer  obtain  a unique  solution.  In- 
stead, we  seek,  a family  of  solutions  and  optimize  the  results  by  selecting  frequencies  that  accentuate  the  physical 
factors  in  which  we  are  interested.  The  selection  of  a set  of  sounding  frequencies  here  follows  the  principles 
proposed  by  Kaplan  [1959] . 


GENERAL  METHOD  OF  SOLUTION 


Examination  of  Eq.  (6)  reveals  that  the  solution  of  Eq.  (3)  is  obtained  by  pt  . forming,  for  each  frequency 
(/,,  a different  scale  transformation  on  B^n\v(,p)  over  the  whole  range  of  integration,  and  then  assigning  the  new 
value  to  the  preselected  pressure  level  p,-.  This  suggests  an  alternative  method  of  solution  in  which  the  transforma- 
tion is  applied  directly  to  the  unknown  variable,  in  a manner  similar  to  that  used  by  Chahine  [1970]  for  determin- 
ing the  constant  mixing  ratio.  The  relaxation  equation  for  temperature  then  takes  the  form  of 

7 <rt+1)(pf)  = a,-  7<">Cp,)  (7) 

where  each  ctj  is  a constant  scaling  T^n*(p) , to  satisfy  Eq.  (3)  for  each  frequency  separately.  This  general  approach 
is  required  when  the  integrand  is  a functional  of  the  unknown  parameters  as  in  the  case  of  determining  composition 
profiles,  and  the  case  of  determining  temperature  profiles  when  the  transmittance  is  strongly  dependent  on  tempera- 
ture variations  . Obviously,  when  r is  weakly  dependent  on  T,  the  scaling  constants  a(  for  temperature  can  be 
obtained  directly  from  Eq.  (6). 

To  apply  the  general  approach  for  determining  composition  profiles,  let  us  first  integrate  Eq.  (1)  by  parts  and, 
for  convenience,  write  the  results  as 

Ir.p 

r(.»,(q(.p)),.  ■■)•-——  dlnp  (8) 
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where  q(p)  is  the  composition  profile,  and  the  notation  <q(p)>  indicates  that  the  transmittance  is  a functional 
of  q(p),  since  r depe' . Is  on  the  distribution  of  q(p)  between  p and  p. 

To  determine  q(p,  ) from  a given  set  of  radiance  measurements,  assuming  that  the  temperature  profile  is 
known,  we  pair  each  frequency  v;  with  a pressure  level  pj , then  make  an  initial  guess  for  q'n'(pj)  and  solve  the 
radiative  transfer  equation 


Tty)  - B(vj ) + / r(i>,  <a  • q^{p)) , . - . ) d In  p = 0 

J dl  np 

In  Pi 


(9) 


to  obtain  the  scaling  constants  a,  corresponding  to  each  sounding  frequency  v,-  (see  appendix).  We  generate  the 
following  iterations  through  the  relaxation  equation 


q(n+,)(pJ.)  = <Y/(”)(Pj) 


(10) 


The  iteration  process  is  repeated  until  each  value  of  the  scaling  constants  approach  unity,  which  is  equivalent  to 
satisfying  the  residuals  in  Eq.  (4).  This  relaxation  method  of  solution  leads  to  accurate  determination  of  composi- 
tion profiles  without  any  a priori  information  about  the  expected  solution,  and  the  residuals  converge  to  zero  if, 
and  only  if,  q^(p)  is  the  solution. 


The  process  of  optimizing  the  results  obtained  from  a discrete  set  of  sounding  frequencies  should  not  violate 
the  law  of  conservation  of  information,  so  that  from  a set  of  J sounding  frequencies  a maximum  of  J scaling  con- 
stants otj  can  be  determined.  The  use  of  weighted  scaling  constants  of  the  form 

J 

4"  <"> 

^ Wty,P/) 


in  Eqs.  (7)  and  (10)  can  be  useful  in  the  presence  of  iarge  errors  in  measurement.  In  Eq.  (1 1)  is  the  frac- 

tional contribution  from  the  ith  pressure  level  to  the  total  outgoing  radiance  at  frequency  vj.  An  approximation  to 
the  values  of  the  weights  has  been  given  by  Smith  f 1970]  for  temperature 


Mw.Pj) 

W(vjtpi)  = for  P|<Pj 
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and 


W(i >j,Ps)m 


(12) 
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and  for  composition  profiles  as 


W{n)(Vj-Pi) 


l(n-0 


da 


3 In  p 


(13) 


NUMERICAL  EXPERIMENT 


The  general  relaxation  method  has  been 
applied  to  invert  synthetic  radiance  data  generated 
on  a computer  from  two  freqency  ranges  The 
numerical  experiment  is  intended  to  demonstrate 
the  accuracy  and  stability  of  solutions  derived 
without  any  a priori  information  about  the  ex- 
pected profiles. 

The  7.5.u  CH«  Band  of  Jupiter 

The  set  of  seven  sounding  frequencies  shown 
in  figure  1 has  been  selected  by  Taylor  [1971]  to 
sound  the  temperature  profile  above  the  pressure 
level  of  1 atm  (Z  = 36  km)  in  the  Jovian  atmos- 
phere. Assuming  a constant  mixing  ratio  of  methane, 
and  the  model  temperature  profile  shown  in  figure  2, 
a set  of  seven  simulated  values  of  / (u, ) were  gen- 
erated and  used  to  reconstruct  the  model  temperature 
profile  starting  from  three  different  initial  guesses: 

1.  T^lp)  = 100.0 

2.  lK0V)  = 200.0-0.01  (Z-80)2 

3.  r(°)(p)  = 100.U+ 0.01  (Z-80)J 


Figure  1-  Normalized  weighting  functions  dr/d  inp  for  the 
selected  set  of  7 sounding  frequencies  in  the  7.5m  CH,  band 
of  Jupiter.  The  instrumental  slit  function  is  5 cm" 1 for 
frequency  1306  and  20  cm-1  for  the  rest.  | After  Taylor, 
1971). 


The  transmission  function  was  recomputed  during  each  of  the  first  four  iterations,  to  correct  for  the  large  changes  in 
temperature  from  T^°){p)  to  T^(p).  Therefore,  during  the  first  four  iterations  the  integrand  was  treated  as  a 
functional  of  T(p)  and  the  general  relaxation  method  was  applied;  the  corresponding  scaling  factors  a(  were 
evaluated  numerically  by  linear  interpolation.  The  solution  converged  rapidly  and  uniformly  so  that  after  the  fourth 
iteration  the  values  of  ct,  were  derived  directly,  without  the  need  for  interpolation,  from  the  Planck  function  ratio 
of  Eq.  (6),  as 


°r 


bi'ilT 


(«) 

(Pi) 


ln(l-  jl-exp^/r^jl  [/, <*>/£]) 


Figure  3 shows  the  dependence  of  the  final  answer  on  the  initial  guess.  The  span  of  the  thin  horizontal  bar  can  en 
velope  the  whole  family  of  solutions  obtained  from  these  three  widely  varying  initial  guesses. 
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Figure  2.-  Three  different  initial  guesses  used  to  start  the 
iterative  solution  for  the  determination  of  the  model 
temperature  profile  of  Jupitei. 


Figure  3.-  Effect  of  the  initial  guess  on  the  final 
solution  for  the  cases  of  no  errors  and  with  errors 
in  measurements. 


To  study  the  effect  of  measurement  errois  on  the  final  solution  a set  of  simulateu  large  errors  were  introduced 
to  the  set  of  exact  radiance  values.  The  errors  were  given  a random  distribution  cf  signs  and  a maximum  value  of 
9.6%  with  an  rms  value  of  5.42%.  The  corresponding  temperature  profile  was  reconstructed  as  in  the  case  of  zero 
errors.  The  results,  shown  as  heavy  burs  in  figure  3,  clearly  demonstrate  the  weak  dependence  of  the  final  answer 
on  the  initial  guess. 

The  use  of  weighted  scaling  factors 

J 

aj9T(->j,pi)li\np 


^ arOy.PjO/dlnp 


did  not  alter  the  solution.  However,  in  the  case  of  no  errors  in  measurements  the  rate  of  convergence  became  much 
slower;  but  in  the  case  of  large  errors  in  measurement  the  use  of  5^  improved  the  results  and  removed  any  ambiguity 
about  the  terminal  order  of  iteration.  Tigure  4 shows  that  the  solution  for  the  case  of  S.42%  errors  changed  by  less 
than  0.1°  from  iteration  1 1 to  25.  The  iteration  process  should  be  terminated  when  the  residuals  reach  their 
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AS  INDICATED 


Figure  4.-  Varietions  of  the  rms  miduaU  with  iterations  Figure  5.-  Comparison  between  exact  and  reconstructed 

in  comparison  with  the  rate  of  convergence  of  the  profiles  starting  from  an  initial  guess  T^ip)  = 100.0 

temperature  profile,  using  weighted  scaling  factors  and  with  S .24%  rms  errors  in  radiance  measurements, 

the  process  or  relaxation. 

asymptotic  value,  provided  that  quadrature  and  truncation  errors  are  much  smaller  than  errors  in  r and  7,  . Figure  S 
shows  a typical  comparison  between  the  exact  temperature  profile  and  the  reconstructed  temperature  values  with 
linear  interpolation,  for  the  case  of  5.42%  random  errors  in  / with  initial  guess  (1). 

6.3ft  Water  Vapor  Band  in  the  Earth  Atmosphere 

A set  of  sounding  frequencies  shown  in  figure  6 was  selected  for  determining  the  water  vapor  mixing 
ratio  profile  from  simulated  observations  in  the  earth’s  atmosphere.  A set  of  synthetic  radiance  data 
corresponding  to  the  mixing  ratio  profile  shown  in  figure  7 was  generated  for  the  case  of  the  U.S.  Standard 
Atmosphere  (1962)  temperature  profile.  The  computation  of  /((',  ) from  Eq.  (i ) was  carried  out  using  a modified 
Simpson’s  rule  with  a first-order  interpolation  formula  on  the  scale  of  figure  7;  the  relative  error  caused  by  numerical 
quadrature  is  less  than  0.47%. 

These  simulated  radiance  values  were  used  to  reconstruct  the  comp'  sition  profile  starting  with  an  ’ial  guess 
q(°)(p)  = O.ig/kg.  Because  the  transmittance  is  a functional  of  q(p) , *r  values  of  the  scaling  constants  a(-  were 
determined  by  interpolation  throughout  the  iteration  process. 

The  results  illustrated  in  figure  7 show  that  the  relaxation  method  i?  capable  of  reconstructing  the  composition 
profiles  from  radiance  observations  alone  without  any  a priori  information  about  the  solution.  The  selection  of 
different  sets  of  sounding  frequencies  from  the  same  spectral  range,  and  the  use  of  water  vapor  mixing  ratio  q^(p)  - 
p,  p1 , or  p3  as  initial  guesses  di<<  not  change  the  accuracy  of  the  results.  The  solid  bar  is  large  enough  to  envelop  the 
whole  family  of  reconstructed  solutions. 
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Figun  6.-  Normalized  weighting  functions  dr/dlnp  for  the 
selected  set  of  9 sounding  frequencies  computed  for  an 
instrumental  slit  function  of  S cm-1 . 
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Figure  8.-  Reconstructed  mixing  ratio  profile  from  radiance 
measurements  with  2.24%  rms  random  errors. 
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Figure  7.-  Rccc/.sUuctai  mixing  ratio  proFile  from 
exact  synthetic  radiance  measurements. 


Figure  9.  - Rate  of  convergence  of  solutions  u 
judged  by  the  variations  of  residuals  with  number  of 
iterations,  using  weighted  scaling  factors  in  the  process 
of  relaxation.  The  order  of  iteration  should  be  terminated 
when  the  residuals  approach  their  asymptote,  which 
corresponds  to  a cutoff  at  or  after  iteration  24  and  10 
for  z era  and  2.26%  errors,  respectively. 
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The  effect  on  the  reconstructed  profiles  of  random  errors  in  radiance  measurements  is  shown  in  figure  8.  This 
effect  is  more  pronounced  in  the  lower  atmosphere;  however,  it  is  a characteristic  of  the  spectral  range  used  in  this 
illustration  and  should  not  necessarily  be  true  in  other  frequency  ranges. 

The  rate  of  convergence  is  determined  from  the  behavior  of  the  residuals.  Figure  9 shows  that  </?^>rms 
decreases  rapidly  and  then  approaches  an  asymptotic  value  approximately  equal  to  the  sum  of  the  averaged  random 
and  numerical  errors.  The  iteration  process  should  be  terminated  when  the  rms  residuals  approach  this  asymptote. 

Application  of  the  weighted  scaling  constant  of  Eq.  (1 1)  reduced  the  efficiency  of  the  relaxation  process  for 
the  case  of  zero  errors,  but  improved  the  stability  of  the  solution  in  the  presence  of  large  random  errors  in  measure- 
ment, and  removed  any  ambiguity  about  the  number  of  needed  iterations.  In  the  case  of  the  upper  curve  in  figure  6 
the  optimum  profile  can  be  obtained  anywhere  from  iteration  10  to  iteration  35  with  the  same  degree  of  accuracy. 


CONCLUSION 


The  efficiency  of  any  fixed  set  of  sounding  frequencies  can  be  strongly  dependent  on  atmospheric  conditions, 
particularly  in  the  presence  of  a variable  composition  mixing  ratio.  An  improvement  in  the  level  of  efficiency  can  be 
obtained  by  the  use  of  weighted  scaling  factors,  in  the  presence  of  large  errors  in  measurement. 

The  relaxation  method  can  be  applied  in  conjunction  with  any  interpolation  formula.  The  extent  of  interpola- 
tion is  dictated  by  the  quadrature  requirements,  and  by  the  need  to  optimize  the  quality  of  solutions  obtained  from 
a given  set  of  sounding  frequencies. 

Finally,  the  method  of  relaxation,  which  was  applied  in  this  paper  to  study  homogeneous  nonscattering  atmos- 
pheres, can  be  applied  also  to  the  study  nonhontogenous  scattering  atmospheres  without  any  loss  of  generality. 
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APPENDIX 

INTERPOLATION  PROCESS  FOR  DETERMINING  o, 


The  determination  of  a j from  Eq.  (9)  can  be  accomplished  with  almost  any  standard  method  for  solving  non- 
linear equations. 

Let 


In  p 

,»-/ 

InPj 


r(*7 , <atf  qip)) , . . . ) d lnp 
1 d lnp 


and  let 


'O', •.<*,)  = /(>',)  - /(»,•, a,) -*•€ 

where  e is  the  magnitude  of  numerical  quadrature  errors.  We  aim  to  determine  a,-  by  interpolation  beginning  with 

a{.(‘)  * 1.0 

We  compute  r/1)  3 , a^1))  and  use  as  a second  interpolation  point,  say,  the  value  of 


7 [*'/•“« 


.(») 


we  compute  r^2  ) and  obtain  A:  = 3,  by  linear  interpolation  as 


a .(*-  2)  - r.(*~2) 

“i  r» 


a/*-2)  - ^(*-0 
rX*- 2)  - 


> 0 


The  next  value  is  obtained  by  linear  interpolation  between  and  the  smallest  of  all  other  |ry|  values. 

The  root  a,-  corresponds  to  the  minimum  of  all  . 

A check  for  the  possible  existence  of  multiple  roots  is  advisable,  particularly  in  the  presence  of  strong  inversion 
in  the  temperature  profile.  However,  if  the  wrong  root  is  selected,  the  residuals  of  Eq.  (4)  wi'l  diverge. 
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ABSTRACT 


The  features  of  the  limb  inversion  problem  that  differentiate  it  from  the  nadir  problem  ar  j described;  the  most 
important  of  these  are  the  sharply  peaked  weighting  functions  and  the  nonlinear  kernel  that  arises  because  the  atmos- 
phere is  required  to  be  in  hydrostatic  balance.  The  problem  may  be  split  into  two  parts-given  the  piessure  at  one 
level  and  the  measured  radiances,  can  the  temperatures  be  retrieved?  And  how  can  the  pressure  be  obtained  at  one 
level?  The  second  problem  is  shown  to  depend  on  the  first,  which  is  solved  iteratively.  The  convergence  properties 
and  final  accuracies  are  presented  Inversion  to  obtain  trace  constituents  is  briefly  discussed. 


INTRODUCTION 


Previous  writers  have  described  methods  for  inverting  measurements  of  tt-e  upwelling  radiance  in  several  narrow 
spectral  intervals.  The  radiometer  in  those  cases  is  pointed  close  to  the  nadi;.  Limb  radiance  measurements  are  made 
while  a radiometer  is  scanned  from  the  planet  to  space  across  the  planet  limb.  To  the  extent  that  thermal  radia- 
tion emitted  by  the  planetary  atmosphere  is  passively  received  by  a radiometer,  there  are  similarities  between  the 
limb-  ar.d  nadir-viewing  problems.  However,  the  change  in  geometry  changes  the  inversion  problem,  most  impor- 
tantly by  maxing  the  weighting  functions  or  kernels  of  the  integral  equation  strongly  dependent  on  the  temperature 
profile.  Here  we  describe  the  results  of  applying  a version  of  Chahine's  iterative  algorithm  to  this  nonlinear  problem 
and  note  some  of  the  technique’s  convergence  properties. 

Techniques  for  inverting  limb  measurements  are  not  as  developed  as  those  for  nadir  observations,  and  experi- 
ence much  more  limited.  Although  the  first  satellite  inversion  scheme  [King,  19S6]  proposed  an  angular  scan  as  a 
means  of  probing  the  entire  atmosphere,  Kaplan ’s  [19S9]  suggestion  for  a frequency  scan  at  a given  viewing  angle 
has  been  exploited  for  practical  reasons.  Schemes  for  inverting  measurements  of  limb  radiance  have  been  put  for- 
ward since  the  mid-1960s,  but  only  in  the  last  year  has  an  algorithm  been  developed  to  use  data  that  are  easily 
measured  with  state-of-ine-art  technology.  The  work  reported  here  should  therefore  be  regarded  as  an  initial 
investigation. 


THE  INVERSION  PROBLEM 


The  geometry  and  associated  rr  nenclature  of  limb  viewing  are  shown  in  figure  1 . The  radiometer  receives 
radiation  emitted  along  a particular  i path,  which  may  be  identified  by  the  tangent  height  (h),  the  altitude  of  the 
lowest  point  (tangent  point)  above  the  surface.  The  atmosphere  is  sounded  by  receiving  radiation  from  difierent 
tangent  heights,  from  h<  0 (ray  paths  intersecting  the  surface)  to  h ~ 100  km,  from  which  a very  small  signal  is 
received. 


* Present  affiliation:  National  Center  for  Atmospheric  Research,  Boulder,  Colorado. 
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Figure  1.-  The  geometry  of  limb  viewing,  indicating  nomenclature.  A profile  is  obtained  bv  measuring  outgoing  radiance  as 

a function  o(  tangent  height. 

Some  features  of  the  inversion  problem  are  immediately  apparent  from  figure  1.  A relatively  minor  point  is 
that  all  the  signal  originates  in  the  atmosphere;  the  boundary  term  in  the  equation  of  radiative  transfer  vanishes, 
since  the  far  side  of  the  atmosphere  is  cold  space.  Although  this  is  not  too  important  for  temperature  determination, 
it  is  an  enormous  aid  in  the  determination  of  minor  atmospheric  constituents,  such  as  ozone  and  water  vapor. 

We  also  note  that  there  is  much  more  emitting  material  along  a horizontal  path  than  on  a vertical  path  through 
the  atmosphere.  This  indicates  that  we  may  get  a measurable  signal  from  high  levels,  but  that  the  large  opaciiy  at 
low  levels  will  not  allow  signals  from  the  tangent  point  to  reach  the  radiometer.  The  problem  is  compounded  by 
clouds.  For  these  reasons,  limb  scanning  is  not  a good  way  to  probe  below  the  upper  troposphere.  Although  lower 
regions  can  be  sounded,  subsequent  results  are  presented  only  above  IS  km. 

More  important  is  the  ink  .ent  high  vertical  resolution.  From  the  geometry,  it  is  clear  that  none  of  the  signal 
originates  below  the  tangent  point.  If  the  atmosphere  above  the  tangent  point  is  divided  into  a series  of  thin  shells 
of  equal  thickness,  the  longest  horizontal  segment  of  the  ray  will  lie  in  the  shell  immediately  above  the  tangent  point. 
Because  density  fails  off  exponentially  with  height,  this  shell  will  also  contain  a large  fraction  of  the  mass  along  the 
path.  For  a wide  range  of  tangent  heights,  the  largest  part  of  the  emitted  radiation  will  originate  in  this  shell.  Thus, 
if  we  write  an  expression  for  the  outgoing  radiation  / when  viewing  tangent  height  h as 
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Figure  2.-  Limb  viewing  weighting  functions  W<h ; r')  (defined 
in  Eq.  (1))  for  the  ideal  case  of  an  instrument  with  an  infi- 
nitesimal vertical  field  of  view.  The  strongly  peaked  shape 
is  an  important  difference  from  the  broader  nadir-viewing 
weighting  functions. 


in  which  subscript  i denotes  the  spectral  interval,  B 
the  Planck  intensity,  T the  temperature,  and  z the 
altitude,  then  z')  is  a weighting  function,  indi- 
cating how  much  level  contributes  to  the  intensity 
received  when  viewing  h. 

Weighting  functions  for  several  tangent  heights 
through  a midlatitude  atmosphere  are  shown  in  figure 
2 for  a broad  band  covering  most  of  the  IS  pm  band 
of  C02  with  an  infinitesimal  vertical  field  of  view.  For 
tangent  heights  above  25  km  a large  part  of  the  contri- 
bution comes  from  within  about  3 km  of  the  tangent 
height.  These  should  be  compared  to  the  much  broader 
nadir  weighting  functions  discussed  in  the  preceding 
sections  of  this  chapter.  Below  25  km,  the  weighting 
functions  begin  to  look  like  the  broad  nadir-viewing 
weighting  functions,  although  there  is  still  a spike  at 
the  tangent  point  due  to  the  less  strongly  absorbing 
wings  of  the  band. 

The  narrow  weighting  functions  give  an  indica- 
tion of  the  potential  vertical  resolution.  A real  instru- 
ment will  have  a finite  field  of  view,  and  its  weighting 
functions,  convolutions  cf  ones  like  those  in  figure  2, 
will  be  broader.  (This  immediately  indicates  that  if  one 
is  to  use  the  possible  inherent  vertical  resolution 
the  vertical  instrument  field  of  view  at  the  horizon 
must  be  less  than  about  6 to  8 km  to  improve  on  nadir 
viewing.)  However,  spectral  resolution  is  not  necessary, 
and  a broad  spectrai  interval  may  be  used. 


The  most  important  difference  comes  about  because  of  the  sensitivity  of  the  weighting  functions  to  the  amount 
of  emitting  material,  especially  at  the  tangent  point,  and  the  transmission  along  the  path.  For  temperature  determina- 
tions we  shall  measure  the  radiation  emitted  by  a uniformly  mixed  constituent,  carbon  dioxide.  This  means  that  the 
weighting  functions  will  be  sensitive  to  the  atmospheric  density  profile,  which  in  turn  depends  on  the  temperature 
profile  through  the  hydrostatic  equation.  Simply  stated,  higher  temperatures  in  the  lower  part  of  the  atmosphere 
move  density  surfaces  (or,  approximately,  surfaces  of  constant  optical  depth)  upward.  Because  of  the  sharp  weight- 
ing functions,  even  a shift  of  a given  density  surface  by  1 km  can  appreciably  modify  the  weighting  functions.  Thus, 
instead  of  Eq.  (1)  we  write 


/,(*)  = fsiinz)  1 Wfinzy.h-.z]  dz 


(2) 


To  develop  die  specific  expression  to  be  inverted,  we  note  from  the  geometry  of  figure  1 that  the  outgoing 
radiance  from  a nonscattering  atmosphere  in  local  thermodynamic  equilibrium  may  be  written 
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where  r is  the  transmission  between  x and  the  spacecraft  (<*)  along  the  path  through  h,  x is  the  distance  co- 
ordinate along  the  ray  path  with  oigin  at  the  tangent  point,  and  the  other  symbols  are  as  above. 

For  a finite  band  in  the  spectrum 


r(h;x ) = r(h;a,p ) 


(4) 


in  which  a,  p are  the  temperature  weighted  absorber  amount  and  temperature-absorber  weighted  pressure,  respec- 
tively [Goody,  1964;  Rodgers  and  Watshaw,  1966]. 

Incorporating  Eq.  (4)  into  Eq.  (3),  converting  to  a vertical  integral,  using  the  hydrostatic  law  and  perfect  gas 
iaw.  and  specializing  to  a uniformly  mixed  constituent,  ws  obtain  [Gille  and  House,  1971  ] . 
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where  p0  is  the  pressure  at  an  arbitrary  level  z(y. 

C is  a constant,  and  a(z'),  b(z')  contain  constants 
and  the  information  on  the  temperature  variation 
of  the  transmission.  Subscripts  a and  p denote 
the  z anterior  and  posterior  to  the  tangent  point. 
The  exponential  term  relates  the  density  at  z to 
that  at  zQ  through  the  hydrostatic  equation. 
Although  not  written  explicity,  the  transmission 
derivatives  depend  on  density  in  the  ray  paths,  and 
consequently  on  the  hydrostatic  equation  as  well. 

Equatior 'S)  makes  use  of  the  fact  that  all 
other  altitudes  may  be  measured  relative  to  zQ. 
This  is  extremely  important,  because  it  is  instru- 
mentally  tar  easier  to  determine  relative  altitudes 
than  absolute  altitudes.  Thus,  given  a measured 
radiance  profile  like  the  broad  channel  measure- 
ment in  figure  3,  but  with  no  attached  altitude 
scale,  we  may  pick  a particular  point  on  the  curve 
such  as  A,  designate  its  height  as  the  reference 
height  z0,  and  measure  all  other  heights  relative 
to  it.  Hencefoith,  for  convenience  we  designate 
relative  tar.gent  heights  and  altitude  by  asterisks: 
h -z0  - h*,z  -z0  - z*. 

Equation  (S)  defines  the  inversion  problem. 
We  have  measurements  of  l(h *),  and  wish  to 


Figure  3.-  Limb  radiance  as  a function  of  tangent  height  for 
wide  (585-705  cm-1 ) and  nanow  (630-685  cm-1 ) channels 
in  the  15  um  band  of  carbon  dioxide.  Their  difference 
results  from  emission  by  generally  weaker  lines  away  from 
the  band  center,  and  provides  information  on  deeper  atmos- 
pheric layers.  The  arrow  (A)  indicates  the  arbitrary  level 
whose  altitude  is  designated  z0  where  the  pressure  p0  is 
required.  (See  Description  and  Results  of  an  Iterative 
Inversion  Algorithm.) 


determine  T(z*).  Qearly  p0  is  a critical  parameter.  The  inversion  problem  may  then  be  separated 
into  two  parts: 

1.  Given  pQ  and  measurements  of  I(h*),  how  may  !T(r*)  be  determined? 

2.  How  may  pQ  be  determined? 

The  first  question  is  treated  below.  The  second,  closely  related  to  the  first,  is  discussed  in  a later  section. 

Description  and  Results  of  an  Iterative  Inversion  Algorithm 

The  complex  nonlinear  manner  in  which  the  temperature  enters  on  the  right-hand  side  of  Eq.  (5)  suggests  that 
some  type  of  iterative  method  will  be  required.  Basically,  the  purpose  is  to  bring  about  temperature  fields  that  satisfy 
both  the  radiative  transfer  and  the  hydrostatic  equations.  The  sharply  praked  weighting  functions  indicate  that  dis- 
crepancies between  observation  and  calculation  at  a particular  tangent  height  may  be  used  to  correct  the  temperature 
at  that  elevation,  as  in  Outline's  (1968, 1970]  method. 

The  steps  in  such  a solution  are  as  follows: 

1 . In  the  first  iteration,  an  initial  temperature  profile  is  assumed. 

2.  The  temperature  profile  is  used  with  po  and  the  hydrostatic  equation  to  distribute  the  atmospheric  mass 
in  altitude.  The  importance  of  this  step  is  the  major  difference  from  a nadir-viewing  algorithm. 

3.  The  outgoing  radiation  is  calculated  as  a function  of  tangent  height  from  one  of  the  equivalent  Eqs.  (2), 
(3),  and  (S). 

4.  The  calculated  outgoing  radiances  are  compared  to  the  observed  values,  tangent  height  by  tangent  height. 
If  a convergence  criterion  is  satisfied,  the  temperature  profile  is  accepted  as  the  solution.  A typical  con- 
vergence criterion  is  that  the  rms  difference  between  the  calculated  and  observed  radiances  over  the  profile 
(the  radiance  residuals),  be  slightly  less  than  the  noise  of  the  radiometric  measurements. 

5.  If  the  convergence  criterion  is  not  met,  the  discrepancies  between  the  calculated  and  observed  radiances 
at  each  tangent  height  are  used  to  adjust  the  temperature  there. 

Making  the  assumption  that  the  discrepancy  is  due  to  errors  in  Planck  radiance  alone,  a corrector  or  relaxation 
equation  is  [GilU  and  House,  1971]. 


7 "+l  (i h •)  = r"(h*)  + 


2Tn(h*)3  I(h*)-Icn(h*) 
CjU  /(*•)  + Ic\h*) 
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in  which  superscripts  denote  the  order  of  iteration,  is  the  second  radiation  constant,  v is  a mean  frequency,  / 
is  the  observed  radiance,  and  Ic  is  the  computed  radiance  value.  The  improved  temperature  is  then  used  in  step  2, 
and  the  procedure  iterated  until  the  convergence  criterion  is  met. 

Results  of  applying  this  procedure  to  data  synthesized  for  a broad  spectral  interval  covering  much  of  the  15  pm 
band  of  COj  for  an  actual  midlatitude  winter  sounding  are  shown  in  figure  4,  in  which  the  solid  line  is  the  original 
sounding.  The  dotted  line  is  an  inversion  of  ideal  data-that  is,  noise  free  data  having  an  infinitesimal  field  of  view 
at  the  horizon  The  inverted  points  were  spaced  1 km  apart  vertically.  The  rms  radiance  residuals  were  reduced  only 
to  0.01  Wnf*  sr"‘ , which  mi  taken  ai  a rough  instrumental  noise  figure.  The  rms  temperature  error  is  0.91°K,  and 
the  solution  reproduces  die  initial  profile  very  well. 

Note  that  the  thin  warm  anomaly  at  37  km  is  retrieved  quite  accurately.  Smaller  variations  below 
25  km  are  not  followed  as  well. 
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The  dashed  line  indicates  the  results  obtained  when  data  for  the  same  atmosphere,  but  now  including  the 
effects  of  a finite  field  of  view  and  random  noise  of  0.01  Wm-1  sr-1 . Now  the  rms  temperature  error  is  4.2°K  from 
IS  to  60  km,  and  2.43°K  from  15  to  54  kr.i.  In  this  case,  the  thin  features  are  not  retrieved  very  well  (there  is  just  a 
hint  of  the  3?  km  anomaly),  but  the  overall  curve  is  followed  quite  closely.  Large  unreal  oscillations  occur  at  the  top 
of  the  sounding. 


The  spacing  of  the  inverted  points  is  not  fixed.  We  may  space  them  3 km  apart  instead  of  ! km,  in  which 
case  the  results  are  similar  to  those  for  1 km  spacing,  except  that  the  large  oscillation  at  the  top  of  the  sounding  do 
not  appear. 


CONVERGENCE  PROPERTIES 

It  is  < ./  instructive  to  follow  the  convergence  to  a solution.  Figure  5 displays  the  reduction  of  the  radiance 
residuals  as  a function  of  iteration  number  for  the  two  cases  discussed  above.  Also  shown  are  radiance  residuals 
when  the  inverted  points  are  spaced  3 km  apart,  and  intermediate  points  are  obtained  by  linear  interpolation.  The 
3 Vm  calcualtions  require  much  less  computer  time. 


This  is  a very  instructive  plot.  Note  first  that  for  noisy 
or  ideal  data,  1 or  3 km  spacing,  the  residuals  are  nearly 
identical  for  the  first  nine  iterations.  Beyond  that  point, 
the  residuals  of  the  ideal  data  continue  to  drop  rapidly, 
while  the  noisy  data  residuals  le  /el  off.  The  operational 
significance  is  important:  the  first  stages  of  an  inversion 
may  be  done  rapidly  with  widely  spaced  points,  and  later 
stages  may  be  done  with  higher  resolution. 


TEMPERATURE  (K) 

Figure  4.-  Solutions  obtained  by  inverting  synthesized  ideal 


data  (no  noise,  infinite  vertical  resolution),  and  realistic 
data  (0.01  Wm~2  sr-1  random  noise,  2 km  field  of 
view),  compared  to  an  actual  midlatitude  winter 
sounding. 


Figure  5.-  Root-muan-square  radiance  residuals  anrf  temperature 
errors  as  functions  of  iteration  number,  for  the  midlatitude 
winter  profile  for  ideal  and  realistic  data,  1 and  3 km  spac- 
ing of  the  inverted  values. 
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The  second  notable  point  is  that  for  the  first  several  iterations  there  is  an  oscillatory  behavior  with  small  resi- 
dual increases  being  followed  by  large  decreases.  This  reflects  the  up  and  down  sloshing  of  the  atmosphere,  caused 
by  application  of  the  hydrostatic  equation  at  each  step. 

The  behavior  of  the  rms  temperature  errors  also  is  shown  in  figure  S.  For  ideal  1 km  data,  the  temperature 
error  drops  monotonically,  reaching  0.3°K  after  17  iterations.  For  the  ideal  3 km  data,  it  drops  for  four  iterations, 
reaches  a minimum  of  1.2°K  at  the  sixth  iteration,  and  then  slowly  rises  to  1.7°K.  At  the  eighth  iteration,  when  the 
residuals  have  dropped  below  the  noise  (“convergence”),  the  error  is  1.35°K.  Clearly,  in  this  case  the  convergence 
criterion  is  letting  the  solution  go  past  the  optimum  point  After  the  fourth  iteration,  the  1 km  inversion  is  always 
better  than  the  3 km  inversion. 

For  data  containing  0.01  W/m2  sr  noise,  the  rms  temperature  errors  for  both  1 and  3 km  inversions  pass  through 
minima  before  the  radiance  residuals  reach  the  noise  level.  In  this  particular  case,  the  3 km  temperature  error  reaches 
a minimum  value  of  1.9°K  at  the  fouith  iteration,  and  only  l.SS°K  from  15  to  54  km.  At  the  eighth  iteration,  when 
the  radiance  residuals  have  dropped  below  the  noise,  the  temperature  -rrors  are  2.1°K  for  both  15  to  54  and  15  to 
60  km.  Although  further  iteration  reduces  the  rms  radiance  errors  from  8.8X  10“3  to  5.6X  10-3 , the  rms  temperature 
error  increases  to  2.5°K,  or  2.4°K  from  15  to  54  km. 

For  noisy  data,  the  results  of  1 km  spacing  of  inverted  points  is  worse.  Now  the  15  to  60  krn  error  is  3.2°K  ai 
the  fourth  iteration,  rising  to  4.2°K  at  the  eighth,  and  increasing  to  about  5.5°K  with  further  iteration.  If  we  restrict 
our  attention  to  the  rms  temperature  errors  from  1 5 to  54  km,  we  again  find  a minimum  of  2.2°K  at  the  fourth  itera- 
tion, increasing  to  2.5°K  at  the  eighth,  and  2.8°K  at 
iteration  25.  As  could  be  seen  in  figure  4,  with  1 km 
spacing  a great  deal  of  the  error  is  accumulated  at  the 
top  of  the  sounding;  the  rms  error  is  much  improved 
by  ignoring  this  region.  Comparison  of  the  inversion 
data  and  the  15  to  54  km  data  with  1 km  spacing 
indicates  that  we  have  lower  accuracy  in  the  latter 
case,  although  higher  resolution.  This  is  analogous  to 
the  use  of  larger  or  smaller  value®  nf  y in  Twomey ’s 
[1963]  method  (see  also  earlier  se  . ons  in  this  chapter 
by  Mateer  and  by  Conrath  and  Revan)  or  points  on  a 
Backus-Gilbert  type  curve,  which  depicts  the  tradeoff 
of  accuracy  versus  resolution  explicity  (see  the  dis- 
cussion by  Parker  in  this  chapter). 

For  any  iterative  scheme,  and  especially  for  a 
strongly  nonlinear  problem,  we  expect  the  rate  of 
convergence  and  perhaps  the  final  accuracy  to  depend 
on  the  initial  guess.  The  results  of  inverting  a tropical 
profile  starting  from  the  midlatitude  standard  atmos- 
phere and  from  the  tropical  standard  atmosphere  are 
shown  in  figure  6.  Note  that  for  the  standard  atmos- 
phere start,  18  iterations  are  necessary  for  noise-free 
data  and  21  for  noisy  data.  The  tropical  start  residuals 
drop  much  more  rapidly,  reaching  the  noise  level  in 
9 and  1 1 iterations. 

This  suggests  that  operationally  one  might  try 
several  initial  profiles  to  discover  the  one  with  the 
lowest  residuals,  or  the  most  rapid  drop  in  residuals 
after  one  iteration,  to  get  the  best  starting  profile. 


Figure  6.-  Root- mean-square  radiance  residuals  and  temperature 
errors  as  functions  of  iteration  number  for  a tropical  atmos- 
phere. Thi  effects  of  starting  from  a midlatitude  standard 
atmosphere  or  the  tropical  standard  atmosphere  are  shown  with 
ideal  and  realistic  data,  with  inverted  levels  spaced  every 
3 km. 
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Except  for  the  noise-free  data  on  the  standard 
atmosphere  start,  all  the  rms  temperature  errors  go 
through  a minimum,  although  the  standard  atmosphere 
minimum  is  very  shallow.  Error  at  “convergence''  for 
the  noisy  data  is  2.65°K  from  15  to  60  km  for  the 
standard  atmosphere  start  and  2 35°K  for  the  tropical 
start,  up  from  2.1°K  at  the  fourth  iteration. 

Consideration  of  the  retrieved  temperature  pro- 
files (not  shown)  indicates  that  with  the  size  of  the 
instrumental  noise  used,  there  is  difficulty  in  retrieving 
the  position  of  the  tropical  tropopause  (approximately 
18  km)  and  the  temperature  distribution  below  it. 
Thus,  errors  at  the  top  and  bottom  of  the  1 5 to  60  km 
region  are  larger  than  in  the  intermediate  region.  ar  of 
course  we  might  expect  from  the  weighting  functions. 

DETERMINATION  OF  p0 


TEMPERATURE  T (K) 


. , . , , , , Figure  7.-  The  effect  of  the  value  of  pQ  on  ihe  inferred  profile. 

In  the  study  so  far.  we  have  assumed  that  The  dashed  tine  is  the  true  profile,  the  U.S.  Standard  Atmos- 

Pu  is  known.  Now  we  shall  show  how  this  problem  may  phere.  Solid  lines  are  inversion  solutions  with  the  values  10, 

solved,  if  the  preceding  one  is.  Figure  7 shows  the  effects  12,  and  14  mb  assumed  for  pQ.  The  correct  value  is  12  mb 
of  inferring  a temperatu.e  profile  (in  this  case  the  U.S. 

Standard  Atmosphere)  for  three  different  values  of  pa.  There  is  no  way  to  tell,  on  meteorological  or  mathematical 
grounds,  which  is  the  correct  temperature  profile.  A systematic  variation  can  be  seen,  however.  If  the  pressure  is 
high,  low  temperatures  are  obtained  at  high  levels  and  high  temperatures  at  low  levels.  In  between  is  a region  in 
which  the  temperature  is  correct,  irrespective  of  errors  in  p0. 


The  physical  reasons  for  this  behavior  are  easily  understood  [Gille  and  House,  1971  ] . Too  high  a pressure 
implies  higher  than  correct  densities  everywhere,  which  increases  the  amount  of  emitting  material  and  decreases 
transmittance.  At  upper  levels,  at  which  the  optical  depth  to  the  tangent  point  d is  less  than  1 , the  increased  emitting 
material  dominates  the  decreased  transmittance,  requiring  a lower  than  correct  temperature  for  agreement  with  the 
observed  radiance. 


At  levels  where  d > 1,  the  decreased  transmittance  dominates  and  requires  a higher  than  correct  temperature 
for  agreement.  The  converse  occurs  for  a pressure  too  low.  At  the  level  at  which  d = 1 , the  retrieved  temperature 
is  independent  of  pu. 

This  leads  immediately  to  a method  for  determining  p0  (fig.  8).  T*  :o  channels,  with  different  spectral  opacity 
and  herefore  with  d = 1 at  different  levels,  are  employed.  Each  disp’a/s  *l,e  behavior  shown  in  figure  7,  and  in 
generr.'  the  temperatures  inferred  in  the  two  channels  will  not  agree.  If  the  pressure  is  too  high,  the  opaque  tempera- 
ture will  be  higher  than  the  transparent  channel,  and  vice  versa.  Numerical  studies  [ Gille  and  House,  1971  ] have 
shown  that  this  temperature  difference  between  the  two  channels  is  linear  in  pressure,  with  a zero  crossing  very  close 
to  p0.  h consideration  of  all  error  sources  suggests  that  p0  may  be  determined  to  ±0.3  mb,  with  ±0.17  mb  probably 
possible.  Thus,  successful  determination  of  pc  resu  on  accurate  and  rapid  inversion  of  spectral  data  according  to  a 
scheme  like  the  one  outlined  earlier,  but  for  two  spectral  channels. 


DETERMINATION  OF  TRACE  CONSTITUENT  AMOUNTS 


If  the  temperature  profile  is  known,  measurements  in 
suitable  spectral  channels  may  be  inverted  to  obtain  concen- 
trations of  variable  constituents  such  as  ozone  and  water 
vapor.  Equation  (1)  is  now  the  basic  equation  to  invert,  and 
the  constituents  are  determined  from  their  effects  on  the 
weighting  function.  This  is  a much  more  nearly  linear  prob- 
lem, and  one  of  the  methods  described  earlier  might  be  used. 
McKee  etal.,  [1969]  and  Russell  and  Drayton  [1972]  have 
employed  an  “onion-peeling”  method,  working  down  from 
the  top  of  the  tmosphere  to  determine  water  vapor  and 
ozone  profiles  from  real  and  synthesized  data.  This  is 
equivalent  to  solving  a triangular  matrix.  Home  and  Gille 
(in  preparation)  have  employed  an  iterative  method  similar 
to  the  one  described  for  temperature.  Now  the  relaxation 
equation  is 


cnH(h*)  = cnQt*) 
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Figure  8.-  Schematic  diagram  of  the  two-channel  method  for 
determining  p0.  Only  the  correct  value  remits  in  a zero 
mean  temperature  difference  between  inversions  of  radiance 
data  from  the  two  channels. 


in  which  c is  the  mixing  ratio.  Usually  no  more  than  two  iterations  are  necessary  to  obtain  agreement  within 
experimental  error.  Three  examples  of  retrievals  of  realistic  data  at  3 km  intervals  to  get  stratospheric  water  vapor 
are  shown  in  figure  9. 


CONCLUSIONS 


The  results  demonstrate  that  an  iterative  procedure  of  the  type  proposed  by  Chahine  [1968]  will  enable 
temperature  solutions  to  be  retrieved  from  measurements  of  limb  radiance  in  the  C02  band,  although  the  kernel  is  a 


strong,  nonlinear  function  of  the  temperature.  The  algorithm 
also  requires  that  the  pressure  at  some  level  be  specified;  by 
starting  with  an  initial  guessed  pressure  and  applying  the 
algorithm  to  measurements  in  two  spectral  chcnM:..  thh 
pressure  may  be  accurately  determined,  and  a self-contained 
inversion  achieved.  The  inversion  of  trace  constituent  pro- 
files is  a nearly  linear  problem,  and  needs  no  special 
treatment. 
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Figure  9.  - Three  imrerrions  of  synthesized  realistic  data  to 
obtain  distributions  of  water  vapor  in  the  stratosphere. 
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DISCUSSiON 


King:  You  indicated  that  noise  is  more  of  a limit  to  what  you  can  infer  or  that  noise  prevents  you  from  taking 

advantage  of  the  edge  on  helping  of  your  weighting  function.  Is  that  right? 

Gille:  Noise  and  the  finite  field  of  view  are  both  factors,  but  noise  at  this  level  is  mor~  important. 
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2.  ACTIVE  ATMOSPHERIC  SOUNDING 


In  contrast  to  the  passive  (radiance)  measurement  techniques  discussed  in  Chapter  1 , this  chapter  is  concerned 
with  measurements  of  reflected  and/or  scattered  radiation  from  the  atmosphere  arising  from  man-made  probing 
sources- that  is,  active  radar  experiments.  Both  monostatic  and  bistatic  configurations  with  pulse  and  continuous- 
wave  techniques  are  in  use,  as  well  as  acoustic  and  electromagnetic  (radio-frequency  and  optical  (lidar))  radars. 

The  data  are  inverted  to  yield  vertical  profiles  of  wind  speed  and  direction,  turbulence,  precipitation, 
refractive  index  structure,  particulate  concentrations,  and  gaseous  molecule  concentrations.  Active  remote  sounding 
is  a rapidly  growing  research  field,  and  great  technological  advances  leading  to  sophisticated  sounding  techniques 
have  been  made  in  recent  years;  however,  because  this  is  a relatively  new  field,  the  inversion  algorithms  that  have 
been  developed  specifically  for  this  discipline  are  still  rather  primitive.  It  is  clear  that  scientists  active  in  this  area 
should  benefit  enormously  from  algorithms  developed  and  described  elsewhere  in  this  volume. 

A.  M.  Peterson  organized  and  chaired  the  session  devoted  to  active  atmospheric  sounding. 
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Acoustic  radar  sounding  techniques  were  used  tc  measure  the  wind  velocity  and  direction  in  the  first  300  m of 
the  atmosphere.  Angle-of-arrival  and  Doppler  techniques  were  developed  to  obtain  two  independent  measurements 
of  the  wind  field.  These  techniques  and  preliminary  experimental  results  are  described  briefly. 


INTRODUCTION 


The  potential  of  the  acoustic  radar  sounding  technique  for  the  remote  measurement  of  the  structure  of  the 
lower  atmosphere  (wind  velocity,  temperature,  and  so  forth),  has  been  recognized  for  some  time  [McAllister,  1968, 
McAllister  et  al„  1969;  McAllister  and  Pollam,  1969;  Little,  1969;  Beran,  1970],  This  technique  c insists  of 
transmitting  short  pulses  (40-200  msec)  of  acoustic  energy  at  a carrier  frequency  of  800  Hz  at  a power  level  of  100  W 
from  an  acoustic  array  on  the  ground,  and  recording  the  eneigy  backscattered  from  turbulent  fluctuations  of  tempera- 
ture and  of  wind  velocity  on  a facsimile  recorder,  to  give  a height  versus  time  of  day  display  of  the  structure  of 
turbulence  in  the  first  few  kilometers  of  the  atmosphere.  It  has  been  shown  [McAllister,  1968;  McAllister,  et  al. 

1969;  McAllister  and  Pollard,  1969]  that  thermal  plumes,  radiation  inversions,  and  sea  breeze  circulations  can  be 
studied  with  the  aid  of  the  acoustic  sounder.  More  recently,  a joint  acc ostic  sounding  experiment  was  conducted 
with  the  W ave  Propagation  Laboratory  (WPL)  of  the  National  Oceanic  and  Atmospheric  Administration  (NOAA' 
Boulder,  Colorado,  when  the  sounder  was  operated  alongside  a ISO  m meteorological  tower  situated  at  Haswell, 
Colorado.  Good  progress  was  made  toward  interpreting  the  vast  amount  of  detail  contained  in  the  sounder  record 
in  terms  of  the  profiles  of  tempera'ure  and  wind  velocity,  and  the  turbulent  flux  of  heat  and  momentum  [Bean  and 
McAllister,  1971], 

During  this  series  of  experiments,  however,  there  were  some  problems  in  the  use  of  tower  data  to  interpret  the 
sounding  data.  First,  it  was  necessary  to  separate  the  tower  and  the  sounder  by  300  m to  avoid  echoes  from  the 
tower  structure;  hence  the  correlation  between  tower  and  sounder  data  was  not  always  good  during  nonstationary, 
atmospheric  processes.  Second,  the  levels  of  instrumentation  on  the  tower  were  limited  by  practical  considerations 
to  40  m,  95  m,  and  150  m;  frequently,  the  height  range  of  inteisst  lay  between  these  levels  and  no  tower  data 
could  be  obtained  for  this  range.  Third,  the  sounder  was  a more  sensitive  indicator  of  temperature  fluctuations 
than  were  the  temperature-sensing  elements  on  the  tower,  and  it  was  not  always  possible  to  investigate  the  structure 
recorded  by  the  sounder  under  very  stable  conditions  in  light  winds.  The  need  to  develop  the  acoustic  sounder  into 
an  independent,  quantitative,  meteorological  instrument  for  interpretation  of  fine  detail  recorded  during  nonstationary 
processes  was  thus  highlighted. 

Angle  of  arrival  {McAllister  et  al.,  1969]  and  Doppler  [Kelton  and  Bricout,  1964]  techniques  have  been  added 
to  the  sounder  to  obtain  two  independent  measurements  of  wind  velocity.  Here  we  discuss  these  techniques  and 
present  the  preliminary  results  cf  the  experiments. 
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LIST  OF  SYMBOLS 


a(  0 ) scattered  power,  per  unit  volume,  per  unit  incident  flux,  per  unit  solid  angle  at  an  angle  8 from  the 
initial  direction  of  propagation 

X acoustic  wavelength 

k wave  number  of  the  sound  wave 

Cv  structure  constant  of  the  wind 

C(  structure  constant  of  the  temperature 

C velocity  of  sound 

T absolute  temperature 

w horizontal  wind  velocity 

Va  total  wind  velocity  measured  by  anemometer 

Vq  total  wind  velocity  measured  by  angle  of  arrival  technique 

Vr  total  wind  velocity  measured  by  Doppler  technique 

Ap  wind  direction  at  height  of  anemometer 

A-0  wind  direction  measured  by  angle  of  arrival  technique 

A/  wind  direction  measuieu  by  Doppler  technique 

SCATTERING  OF  SOUND  IN  A TURBULENT  ATMOSPHERE 


The  propagation  of  sound  through  a turbulent  medium  has  been  studied  for  some  time,  and  the  work  of 
Ford  and  Meecham  [1960],  Tatanki  [1961  ] ,KalIistraiova  [1961] , and  Monin  (1962)  has  resulted  in  an 
equation  describing  the  scatter  of  sound  waves  by  tutbulent  fluctuations  of  temperature  and  of  wind  velocity. 
The  equation  is: 


o(0)  = O.O3*1'3  cos20 


C° 


cos' 


2 £ 


C? 


+ 0.13  — 


(1) 


where  o is  the  scattered  power,  per  unit  volume,  per  unit  incident  flux,  per  unit  solid  angle  at  an  angle  8 from  the 
initial  direction  of  propagation;  k is  the  wave  number  of  the  sound  wave;  and  Cv  and  Q are  the  structure 
constants  of  wind  ar.d  of  temperature  ar-'  are  functions  of  the  root  mean  square  difference  in  longitudinal  velocity 
(or  in  temperature)  for  two  points  separat  d by  unit  distance. 


Equation  >'  1 ' shows  that  the  scattered  power  is  the  sum  of  two  terns,  one  for  wind  fluctuations  and  the  other 
for  temperature  fluctuations.  The  wind  term  has  a multiplying  factor  of  cos2  (0/2),  which  means  that  the  wind 
fluctuations  produce  no  energy  in  the  backscatter  direction  that  is,  for  0 * 180°  . Hence,  all  the  eneigy  received  in 
the  backscattered  direction  results  from  the  interaction  of  the  sound  wave  with  turbulent  fluctuations  of  temperature. 
At  all  other  angles,  the  scattered  energy  is  the  sum  of  both  the  wind  and  temperature  terms. 


MEASUREMENT  OF  MEAN  WIND  VELOCITY 


The  vertically  directed  beam  from  the  acoustic  sounder  is  transported  horizontally  by  wind  because  the 
sound  energy  travels  as  a spherical  wave  relative  to  the  air  stream.  This  property  of  the  propagation  of  sound  is  used 
to  measure  the  mean  velocity  of  the  wind  up  to  the  height  of  the  scattering  volume. 


In  the  presence  of  a constant  horizontal  wind  w.  a sound  ray  transmitted  upwind  at  an  angle  0 to  the  verticil 
direction  given  by 


(where  C is  the  velocity  of  sound)  will  reach  the  scattering  layer,  vertically  above  the  transmission  point  After 
scattering,  the  sound  ray  drifts  horizontally  on  the  return  journey  arJ  arrives  back  at  the  receiving  point  from  the 
downwind  direction  at  an  angle  6 . Thus  the  angular  difference  between  the  transmitted  ard  received  rays  is  18 
in  the  presence  of  a constant  wind  If  the  horizontal  wind  is  not  constant  along  ihe  whole  path,  the  angular  difference 
between  the  two  rays  becomes  a function  of  the  integral  of  the  wind  velocity  along  tne  path 


MEASUREMENT  OF  ANGLE  OF  ARRIVAL 


The  angle  of  arrival  of  the  backscat'ered  energy  was  measured  using  interferometry  techniques.  Four  acoustic- 
arrays  were  used  in  this  series  of  experiments;  the  arrays  are  shown  mounted  on  a horizontal  turntable  in  fifure  1 . 

_ Each  array  consisted  of  49  loudspeakers  with  a 
. . X diameter  of  20  cm  arranged  in  a 7 X 7 square  array. 

JL ' **  ~ The  spacing  between  the  rows  of  loudspeakers  v/as 

- 22  cm.  Wooden  side  screens  were  fitted  to  each  array 

reduce  the  sidelobe  level  m the  horizontal  direction 
— .^7  !•  _ When  the  arrays  were  grouped  in  pairs  as  shown,  the 
distance  between  the  phase 
214c  1 5 A ) 


Two  receiving  arrays  were  place-.'  side  by  side 
and  the  phase  difference  between  the  energy  received 
In  the  two  arrays  was  measured  in  « ohase  detector. 
To  minimize  errors  introduced  by  receiver  noise-, 
localized  ambient  noise  sources,  and  other  factors,  the  mean  phase  difference  ’ etween  these  arrays  was  kept  at  zero 
by  electrically  steering  the  transmitted  beam  with  a servo  loop,  whi  -h  compeii^-teo  for  changes  introduced  oy  the 
wind.  The  phase  difference  between  the  two  transmuting  arrays  tha--  was  required  to  • eer  the  beam  was  a measure 
of  the  mean  velocity  of  the  wind. 


Figure  I.  - Two  oblique  arrays  and  two  vertical  arrays 
mounled  on  t hortzontM  turntable. 


In  the  present  case,  the  transmitted  energy  was  contained  within  a cone  defined  by  the  beam  width  of  the 
acoustic  array;  hence  the  backscattered  energy . which  arrived  back  at  ihe  receiving  array,  Fad  a sirdar  spread  in 
angle  of  arrival  about  the  boresight  direction.  The  tear  direction  of  arrival  was  obtained  by  -tvers-.l>ing  the  angle- 
of  arrival  measurements  over  a period  of  several  minutes.  Range  gating  techniques  '.ere  used  to  -.lefine  the  height 
range  of  the  measurement. 

The  acoustic  sounder  was  opera<ed  in  the  bistatic  mode  (figure  2(c))  ic  check  the  accuracy  of  the  angl*  of 
arrival  measurement.  The  pair  of  inclined  arrays  on  the  left  of  figure  I was  mounted  over  tne  vertical  a:.  - of  the 
turntable  and  inclined  at  20°  to  the  horizontal  as  shown  The  second  pair  of  arrays  was  mounted  or  a fixed  trestle. 
100  m away  toward  the  west  and  inclined  at  20°  to  the  horizontal,  facing  the  pair  on  the  turntable  as  shown  in 
figure  2(c).  This  symmetrical,  bistatic  configuration  gave  a scatteru'g  angle  of  |4C°.  and  the  scattered  pov  ei 
therefore  contained  components  arising  from  fluctuations  of  temperature  and  of  wind  velocity  (Eq  (I))  withm  <*-e 
common  volume  of  the  two  overlapping  beams.  The  mean  height  of  the  common  volume  was  137.5  m. 

The  turntable  was  rotated  in  3°  increments  through  a total  angle  of  15°.  Figure  3 is  a plot  of  the  turn  table 
azimuth  direction  versus  the  ph»se  change  introduced  by  the  servo  loop  to  steer  the  transmitted  beam  back  to  the 


original  direction.  Each  measurement  took  10  min; 
the  extreme  values  of  the  phase  angle  recorded  during 
this  time  are  plotted  in  figure  3,  with  theoretical  curve 
shown  (soliu  line)  for  comparison. 


EXPERIMENTAL  MEASUREMENT 
OF  MEAN  WIND  VELOCITY 


The  rour  acoustic  . *»  ys  were  arranged  as  shown 
in  figure  ».  The  noxt.'  est  and  north  east  arrays  were 
used  to  me;'  ” ft  the  west-east  component  ot  the  wind,  and 
the  north-  .-u  and  south-west  arrays  were  used  U,  measure 
the  north-south  component  of  the  wind.  The  south-east 
array  was  used  to  record  the  amplitude  of  t‘  ■ ; backscattered 
energy.  The  two  components  of  the  wind  velocity  were 
measured  by  time  sharing  the  measuring  equipment  be- 
tween the  north  pair  of  arrays  and  the  west  pair  of  arrays. 

Each  pair  was  alternately  switched  into  the  measuring 
equipment  before  successive  transmitted  pulses.  In  this 
way  it  was  possible  to  obtain  continuous  recordings  of  the 
northerly  and  easterly  components  of  the  wind. 

On  the  night  of  April  2, 1971 , a wind  shear  layer 
developed  within  the  radiation  inversion  and  the  structure 
recorded  by  the  sounder  was  similar  to  that  shown  in 
figure  10.  The  upper  limit  of  turbulent  breakdown  (top 
of  the  wind  shear  layer)  was  at  180  m.  The  range  gate 
was  set  between  90  and  150  m.  The  top  trace  in  figure 
4(a)  is  the  total  mean  wind  velocity  constructed  from 

the  northerly  and  easterly  components  recorded  by  the  sounder.  This  is  the  average  wind  velocity  up  to  a height  of 
120  m.  The  dotted  trace  is  the  wind  velocity  recorded  by  an  anemometer  at  a he:ght  of  10  m.  Note  the  excellent 
agreement  between  the  overall  shape  of  the  two  traces  and  the  higher  velocity  recorded  by  the  sounder.  Good 
correlation  between  these  two  traces  would  be  expected  because  the  mean  velocity  recorded  by  the  sounder  includes 
the  velocity  of  the  wind  at  the  surface.  In  figure  4(b),  the  solid  trace  shows  the  direction  of  the  wind  recorded  by  the 
sounder,  and  the  dotted  trace  shows  the  direction  recorded 
by  the  anemometer.  Unfortunately,  there  were  only  eight 
segments  on  the  wind  direction  indicator  mounted  on  the 
anemometer,  and  good  angular  resolution  was  not  possible; 
but  again,  the  similarity  in  the  two  traces  is  evident.  Signif- 
icant changes  in  the  direction  of  the  wind  with  height 
would  not  be  expected  through  a wind  shear  layer  of  this 
kind.  Therefore,  the  ability  to  record  faithfully  the 
direction  of  the  wind  under  these  conditions  leads  to 
confidence  in  the  angle -of- arrival  measurement. 


Figure  2.-  diagram  showing  modes  of  operation  of  acoustic 
sounder,  («)  Vertical  monostatic  mode,  (b)  Oblique 
monostatic  mode,  (c)  Bistatic  mode. 


In  figure  4(c),  the  total  wind  velocity  recorded 
by  the  sounder  (Vg)  is  plotted  against  the  wind  velocity 
recorded  by  the  anemometer  (V^  ).  Each  dot  represents 
a 1 5-min.  sample.  The  good  correlation  between  Vg 
and  Fj  is  seen  clearly. 


Figure  3.-  Curve  showing  response  of  angle  of  arrival 
feedback  loop  to  rotation  of  turntable  in  azimuth. 
The  full  line  shows  the  calculated  response. 
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DOPPLER  MEASUREMENT  OF 
WIND  VELOCITY 
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The  Doppler  spectrum  of  an  acoustic  signal 
scattered  from  isotropic  turbulence  has  been  discussed 
theoretically  by  Ford  and  Meecham  [ 1960]  and  in- 
vestigated experimentally  by  Kelton  and  Bricout  [1964] . 
Ford  and  Meecham  showed  that  the  maximum  of  the 
scattered  power  spectrum  is  shifted  from  the  incident 
frequency  by  an  amount  determined  by  the  Doppler 
shift  due  to  the  radial  component  of  the  mean  flow . 

The  shift  in  the  spectrum  of  the  scattered  power 
was  measured  by  passing  the  received  signal  through 
a fust -order  tracking  filter  in  which  the  capture  range 
was  set  at  ±15  Hz.  This  gave  a time  constant  of 
65  msec  for  the  tracking  filter  that  was  adequate  to 
track  a transmitted  pulse  of  80  msec  duration.  The 
output  of  the  tracking  filter  was  a voltage  that  was 
proportional  to  the  frequency  shift  of  the  scattered 
power  spectrum  relative  to  the  transmitted  frequency. 
Null  detection  techniques  were  used  again  (see  angle- 
of-arrival  measurements)  to  reduce  the  errors  intro- 
duced by  receiver  noise  and  narrowband,  ambient 
noise  sources.  The  output  of  the  tracking  filter  was 
used  to  control  the  carrier  frequency  of  the  trans- 
mitted pulse  in  such  c way  that  an  increase  in  the 
received  frequency  due  to  the  Doppler  effect  was 
compensated  by  a decrease  :n  carrier  frequency. 
Sufficient  gain  was  applied  around  the  servo  loop  to 
keep  the  output  of  the  tracking  filter  near  zero.  The 
time  constant  of  the  Doppler  servo  loop  was  set  at 
30  sec. 


Figure  4.-  Comp*rir.>n  between  total  wind  velocity  and  direction 
measured  by  an'Je  of  arrival  technique  (full  line)  and 
anemometer  (dotted  line)  at  a height  of  10  m. 

(a)  Total  wind  velocity. 

(b)  Direction,  1900C.S.T.  April  1, 1971  to  0700  C.S.T. 

April  2, 1971. 

(c)  Cor  elation  between  total  wind  vclrcity  measured  using 
an^le  of  arrival  technique  (Kg)  and  anemometer  (K^) . 


The  facsimile  record  of  thermal  plumes  shown 
in  0gure  5 was  obtained  with  the  vertical  monostatic 
configuration  of  the  sounder  (figure  2(a))  when  the 
sky  was  clear  and  the  wind  was  varying  between  calm 
and  5 m/sec.  This  is  a characteristic  sounder  record 
when  surface  heating  is  significant  and  the  wind  is 
light  and  variable.  The  range  gate  was  set  between 


120  and  150  m as  shown  by  the  parallel  lines  on  the 

facsir  ole  record.  Th<  top  trace  in  figure  5 is  the  vertical  component  of  the  wind,  drawn  to  the  same  time  scale  as  the 
fact  mile  record.  Note  the  correspondence  between  the  columns  of  intense  acoustic  returns  and  periods  of  upward 
m'  tion  measured  using  the  Doppler  technique.  Similarly,  downward  motion  is  seen  to  be  associated  with  the  relatively 
r ear  regions  between  the  thermal  columns.  The  peak  vertical  velocity  recorded  in  this  sample,  was  of  the  order  of 
1 .6  m/sec. 


The  facsimile  record  shown  in  figure  6 was  obtained  using  the  symmetrical,  bistatic  configuration  of  the  sounder 
(figure  2(c))  during  thermal  plui.te  activity.  It  will  be  noticed  that  the  height  scale  is  now  t onlinear,  since  time  delay 
(linear  scale  on  facsimile  record)  and  *■  are  not  linearly  related  in  the  bistatic  mode.  The  ellipsoidal  surface 
of  constant  time  delay,  which  passes  through  the  center  of  the  common  volume,  is  drawn  schematically  in  figure  2(c). 
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record  of  thermal  plumes.  1500-1600  C.S.7.  April  2,  1971.  Height 


of  range  gate  shown  by  parallel  lines  on  record. 


2-7 


*1 


I 


The  contras!  between  the  records  of  figures  5 and  o.  which  are  records  of  thermal  plume  activity,  is  striking  In 
figure  6.  the  clear  regions  between  the  thermal  plumes  (tig.  5 1 are  now  filled  in  with  returns  and  the  detail  is 
lost  The  additioi.il  energy  was  scattered  from  turbulent  fluctuations  »f  wind  velocity  since,  in  the  bistatic  mode 
of  operation,  the  scatter  ingle  was  140  (Eq  ( 1 )).  The  clear  regions  between  the  thermal  plumes  shown  in 
figure  5 (monostatic  mode)  indicate  that  the  fluctuations  of  temperatur  * were  small  in  the  descending  air  | Warner 
and  Telford.  1 l»f>7  ] 1 he  apparent  fluctuations  of  wind  velocity  in  this  descending  an  which  were  recorded  in 

the  bisutic  mode  (fig  hi,  probably  arose  from  mechanically  induced  turbulence. 

The  dit  'cullies  in  U5"tg  other  meteorological  instrumentation  to  interpret  the  qualitative  land  now  quanti- 
tative) data  receded  by  th<  sounder  and  the  need  to  develop  the  sounder  into  an  independent  instrument  have 
been  noted  The  approach  nas  been  to  ( 1 ) devise  independent  techniques  to  measure  each  parameter  simultaneously 
and  (2)  obtain  at  least  two  ii  dependent  streams  of  data  that  can  be  cross  checks  I for  error  with  the  hope  that 
confidence  eventually  will  be  built  up  in  the  results  of  the  measurements  through  self  consistency.  These  principles 
aie  illustrated  in  the  following  sections 


INDEPENDENT  DOPPLER  MEASUREMENTS  OF  VERTICAL  VELOCITY 


The  vertical  component  of  the  wind  velocity  was  measured  using  the  symmetrical  bistat.-  mode  of  sounder 
operation  (fig.  2(c))  Independent  streams  m data  were  obtained  by  reversing  the  direction  of  tnnsmission  before 
each  transmitted  pulse  The  west  array,  for  example,  was  alternately  switched  to  rite  transmitter  and  then  to  the 
receiver.  The  interleaved  data  so  obtained  were  separated  and  recorded  in  separate  channels.  The  upper  two  traces 
in  figure  7 are  the  two  charnels  of  vertical  wind  velocity.  The  third  face  Iro.n  the  top  is  the  difterence  between 
the  first  two  and  was  recorded  at  the  same  sensitivity.  The  good  • (relation  between  the  two  channeh  ol  vertical 
velocity  is  reflected  in  the  difference  channel  There  was  very  little  change  in  the  character  of  the  difference  channel 
from  1 300  to  1700  C.S.T..  April  7.  197',  even  though  there  were  significant  changes  in  the  velocity  channels.  The 
fourth  and  filth  traces  are  the  wind  velocity  and  temperature  measured  at  10  m from  the  ground  The  record  was 
obtained  after  a sea  breeze  had  developed  at  the  experimental  site  ( 10  km  from  the  coast).  The  temperature  trace 


i 


■ 


figure  7.  Correlation  between  independent  Doppler  measurement*  of  vertical  wind  velocity  ( Vr)  . Traces  I and  2 (from  lop). 
Vff  and  ; Trace  3.  )'rt  minus  Vfj-,  Trace  4,  anemometer  wind  velocity  (f'a);  Trace  S.  Temperature  (in  nt) 
1300-1700  C.S.T  April  7.  1971. 
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clearly  shows  the  decline  of  surface  heating  and  thermal  plume  activity  after  1600.  which  is  reflected  in  the  velocity 
traces  as  a decline  in  vertical  wind  velocity  after  1630.  The  mean  value  of  the  vertical  wind  between  1300  and  1500 
was  zero;  that  is,  there  was  no  net  upward  or  downward  flow  of  air  that  was  measurable.  The  rms  velocity  during 
this  period  was  0.5 1 m/sec  with  peaks  ranging  between  +i  3 m/sec  and  -1 2 m/sec. 


SIMULTANEOUS  MEASUREMENT  OF  THE  HORIZONTAL  WIND 
BY  ANGLE-OF-ARRIVAL  AND  DOPPLER  TECHNIQUES 


The  oblique  monostatic  mode  of  sounder  operation  was  used  to  obtain  the  horizontal  wind  vector  by  the  two 
independent  techniques  of  angle  of  arrival  and  Doppler.  One  pair  of  arrays  was  placed  at  right  angles  to  the  north- 
south  arm  (fig.  2(b))  and  the  boresight  direction  was  set  at  70°  to  the  horizontal.  A second  pair  of  arrays  was  placed 
at  right  angles  to  the  west-east  arm  and  inclined  at  70°  as  shown.  The  two  acoustic  beams  intersected  at  275  m above 
a point,  100  m from  each  pair  of  arrays.  Alternate  angle-of-arrival  and  Doppler  measurements  were  made  on  the 
north  and  eas  arms  to  obtain  measurements  of  both  components  of  the  wind.  This  was  done  by  interchanging  the 
north  and  east  pairs  of  arrays  before  the  transmitted  pulse. 


Angle-of-arrival  measurements  using  the  north 
arrays  gave  the  west -east  component  of  the  wind, 
while  Doppler  measurements  using  these  arrays  gave 
the  north-south  component  of  the  wind.  Similar 
measurements  using  the  east  arrays  gave  the  north- 
south  and  west-east  components  of  the  horizontal 
wind.  The  total  velocity  and  direction  of  the  wind 
were  computed  from  these  measurements  and  plotted 
with  the  wind  velocity  and  direction  at  10  m (fig.  S). 
The  measurements  were  made  during  the  night  when 
the  wind  was  strengthening  prior  to  the  passage  of 
a cold  front.  There  was  a deep  shear  layer  evident 
on  the  sounder  record  to  about  300  m.  The  range 
gate  was  set  within  this  shear  at  120  to  150  m.  The 
similarity  between  the  three  velocity  traces  is  apparent. 
Note  that  the  mean  wind  ( Vq ) to  135  m (angle-of- 
arrival  measurement)  was  closer  to  the  surface  value 
than  it  was  to  the  velocity  (Vr)  at  135  m (Doppler 
measurement),  suggesting  that  the  wind  shear  was 
not  constant  up  to  1 35  m but  increased  with  height 
toward  this  level. 

There  was  very  good  agreement  between  the 
three  methods  of  measuring  ihe  wind  direction. 

Scatter  diagrams  of  total  wind  velocity  were 
constructed  from  1 ,5-min.  samples  as  shown  in 
figure  9(a),  (b),  and  (c).  The  correlation  between 
each  of  the  velocity  measurements  is  shown  to  be 
very  close. 


Figure  8.-  Correlation  between  three  methods  ol'  measuring  total 
wind  velocity  and  direction,  2000-2400  C.S.T.  April  5, 1971. 

(a)  Wind  velocity  - Vr  , Vg  , and  Va  . 

(b)  Wind  direction  -A/  , Az6  , and  AjA . 
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Figure  9.-  Scatter  plots  of  data  from  figure  8(a). 
(a)  V6  and  Va  ; (b)  Vr  and  Va  ; (c)  VT  and  Vg 
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VERTICAL  GRADIENT  OF  WIND  VELOCITY 


The  oblique  monostatic  configuration  of  the  sounder  (fig.  2(b))  was  used  to  measure  the  vertical  gradient  of  the 
wind  velocity.  Two  arrays  were  mounted  on  the  horizontal  turntable  and  inclined  at  an  angle  of  20°  to  the  horizontal 
(fig  I ).  The  turntable  was  rotated  until  the  boresight  direction  of  the  arrays  pointed  directly  downwind  The  height 
of  the  Doppler  range  g3te  was  switched  alternately  between  two  adjacent  levels.  30  m apart,  to  obtain  the  wind  velocity 
at  each  level.  The  difference  between  these  two  Doppler  measurements  of  horizontal  wind  velocity  was  recorded  to 
obtain  the  gradient  over  a thickness  of  30  m. 


The  facsimile  record  shown  in  figure  10  was  obtained  when  a wind  shear  layer  developed  within  a radiation 
inversion  during  'he  night  of  April  5, 1971 . The  wind  velocity  at  the  surface  increased  from  4.5  m/sec  at  2100  CST 
to  6.o  m/sec  at  2200  CST  and  the  temperature  decreased  by  0.5°C  During  the  same  period,  it  will  be  seen  that  the 
top  of  the  shear  layer  decreased  from  250  to  1 50  m.  Twenty-minute  averages  of  the  wind  velocity  at  each  level  were 
obtained  as  shown  in  figure  10  before  the  mean  height  of  each  gate  was  decreased  to  keep  within  the  shear  layer  The 
average  value  of  the  wind  velocity  recorded  in  each  range  gate  tor  each  20-min  period  was  plotted  above  the  sounder 
record  in  figure  10.  The  wind  velocity  recorded  by  the  anemometer  at  10  m above  the  ground  was  also  plotted  The 
vertical  gradient  measured  by  the  sounder  was  2 m/sec/ 100  m for  each  20-min  sample 


200 

HEIGHT 
iN  METRES 


200  220C 

figure  10.-  Top  - Wind  profiles  constructed  from  surface  anemometer  and  Doppler  measurements  of  wind  velocity. 

Bottom  - Simultaneous  inclined  monostatic  sounder  record  showing  descer.ding  shear  layer,  2100-2200  C.S.T.  April  5.  1971 
Height  of  range  gan  shown  by  parallel  lines  on  record. 


The  wind  velocity  could  not  be  measured  at  more  than  two  range  gate  levels  with  the  present  equipment,  but  it 
is  clear  that  if  more  levels  were  available,  or  if  the  meteorological  conditions  were  sufficiently  stationary,  it  would 
have  been  possible  to  measure  the  wind  profile  from  the  surface  to  the  top  of  the  wind  shear  layer. 


CONCLUSIONS 


The  potential  of  the  acoustic  sounding  technique  for  atmospheric  research  and  operations  has  been  appreciated 
for  some  time,  but  much  of  the  detail  of  atmospheric  motions  contained  in  the  sounder  record  still  remains  to  be 
interpreted.  A fundamental  difficulty  is  the  inability  to  make  meteorological  measurements  in  the  same  volume  of 
space  that  is  interrogated  by  the  sounder,  and  progress  to  date  has  been  limited  for  this  reason. 

Angle-of-arrival  and  Doppler  techniques  were  applied  to  the  measurement  of  wind  velocity  as  a first  step  toward 
developing  the  acoustic  sounder  into  an  independent  meteorological  tool.  The  preliminary  results  of  experiments 
were  very  encouraging.  Good  correlation  was  obtained  between  the  wind  velocity  and  direction  measur  ;d  by  two 
independent,  acoustic  sounding  techniques,  and  the  wind  velocity  and  direction  recorded  by  an  anemor  teter  at  the 
surface.  The  agreement  between  the  three  measurements  of  wind  direction  was  particularly  good  when  the  facsimile 
record  showed  the  atmospheric  structure  to  be  relatively  simple. 

The  ability  to  measure  the  wind  field  in  the  first  kilometer  or  so  of  the  atmosphere,  remotely  from  the  ground, 
seems  to  be  within  reach.  The  feasibility  of  measuring  the  mean  profile  of  temperature  in  the  lower  atmosphere  by 
angle-of-arrival  measurements  in  the  vertical  plane  is  being  examined. 
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DISCUSSION 


Peterson:  How  do  you  obtain  profiles? 

McAllister:  Although  we  are  range  gating  at  one  level,  we  could  gate  at  different  levels  for  profiling. 

Reinisch.  Could  you  sound  at  different  frequencies  to  sound  different  heights? 

McAllister:  The  scatter  cross  section  is  a slow  function  of  frequency,  f'3  . Absorption  of  sound  is  a more 

sensitive  function  of  frequency  and  relative  humidity. 

Parker:  To  do  the  inverse  problem  you  must  first  be  able  to  do  the  direct  problem.  Given  a wind  profile  do 

you  have  the  theory  to  predict  the  experimental  data? 

McAllister:  Yes,  but  the  turbulent  medium  is  a complication,  although  a second-order  effect. 


Might:  Since  you  are  depending  on  scatter,  rather  than  reflection,  selection  of  the  turbulence  scale  sizes  should 

be  based  on  the  wavelengths  used.  This  may  have  a strong  frequency  dependence.  Have  you  done  multiple 
wavelength  studies? 


McAllister:  We  have  looked  as  high  as  1 .5  KHz.  Qualitatively  there  is  no  change  from  800  Hz.  Since  the  frequency 

dependence  is  small,  we  would  need  a factor  of  10  change  in  frequency.  Then  absorption  changes  would  be 
important. 

Reinisch:  If  amplitude  measurements  at  different  frequencies  were  made,  information  on  the  type  of  scatterer 

would  be  revealed.  Then  perhaps  one  frequency  could  be  used  for  profiling. 

Bofarski:  Do  you  have  phase  data? 

McAllister:  We  have  relative  phase  data  between  the  receiving  arrays. 

McOatchey:  In  our  problem  - turbulent  scatter  of  laser  radiation  - ways  of  describing  the  turbulence  is  a 

key  unknown.  You  need  detailed  wind  and  temperature  structure.  Therefore,  the  turbulence  should  be 
defined  statistically,  and  an  inversion  technique  is  needed  that  yields  statistical  parameters. 

McAllister:  Provided  the  process  is  stationary,  which  is  often  not  true. 
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ABSTRACT 


Monostatic  and  bistatic  radar  techniques  for  the  measurement  of  the  structure  of  volume  targets  in  the 
troposphere  and  lower  stratosphere  are  reviewed.  The  targets  considered  are  thin  turbulent  layers  in  the  lower 
stratosphere  and  rain  in  the  troposphere.  Examples  are  given  of  measurements  of  both  types  of  atmospheric 
scatterers  taken  with  the  Millstone  Hill  L-band  radar  and  the  Avon-to-Westford  bistatic  scatter  systems. 

The  measurements  of  scattering  from  thin  turbulent  layers  show  that  layers  are  generally  detected  at  or  near 
the  tropopause,  and  in  31  out  of  34  sets  of  measurements,  layers  were  detected  above  the  tropopause  in  the  lower 
10  km  of  the  stratosphere.  The  layers  all  appeared  to  be  widespread,  covering  hundreds  of  km  in  horizontal  extent. 

The  intensity  of  the  layers  varies  in  time  and  space  with  significant  variations  in  tens  of  minutes  and  horizontal 
distances  the  order  of  20  km.  The  threshold  for  turbulent  layer  detection  corresponds  to  an  equivalent  C\  thickness 
product  of  10~13ml/3  at  a range  of  100  km  and  for  lay eis  with  less  than  103m  thickness. 

The  measurement  of  scattering  by  rain  shows  that  in  the  New  England  area  both  convective  and  widespread  rain 
consists  of  a number  of  small  cells.  On  average,  the  cells  appear  to  have  a half-intensity  width  of  3 to  4 km  as  measured 
with  a radar  system  with  a 1 .8  km  resolution  cell  size  for  cells  at  100  km  range. 
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INTRODUCTION 


Conventional  radars  have  been  used  for  many  years  for  the  study  of  precipitation  processes  [Atlas,  1964J . In 
the  past  decade,  as  a result  of  advances  in  tzdar  techniques  and  devices,  high-power  and  high-resolution  radars  have 
been  made  available  to  radar  meteorologists  for  use  in  probing  the  atmosphere.  These  radars  have  been  useful  in  the 
direct  probing  of  two  different  atmospheric  processes:  turbulence  as  revealed  by  fluctuations  in  the  index  of  refraction 
of  the  atmosphere  and  rain.  High-power,  high-resolution  radars  such  as  the  Millstone  Hill  radar  or  the  Wallops  Island 
radars  are  required  for  the  study  of  turbulence  because  of  the  relatively  small  radar  cross  section  per  unit  volume  pro- 
duced by  refractive  index  fluctuations.  High-resolution  radars  are  also  useful  for  the  study  of  rain  since  rain  gene,  .lly 
occurs  in  cells  with  limited  horizontal  extent. 

The  radar  measurements  of  turbulence  a. id  rain  presented  here  were  made  with  the  Millstone  Hill  L-band  radar. 
The  data  are  all  the  results  of  direct  measurements  in  that  the  object  of  the  measurement  program  was  the  cross 
section  per  unit  volume  as  presented.  Inversion  techniques  were  not  used  except  in  their  simplest  form  in  the  prepa- 
ration of  the  data.  The  results  of  the  measurements  are  of  interest  because  they  illustrate  the  spatial  variability  that 
is  to  be  expected  in  meteorological  processes.  The  rain  measurements  have  additional  significance  for  users  of  passive 
microwave  systems  since  rain  is  a source  of  error  in  atmospheric  sounding  for  the  construction  of  temperature  or 
humidity  profiles. 
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THE  RADAR  SYSTEM 

The  parameters  of  the  radar  system  are  given  in  table  1 . The  radar  system  has  a resolution  volume  at  100  km 
that  may  be  approximated  by  a cylinder  with  a 1 .8  km  diameter  and  a 1 .8  km  length.  The  system  has  sufficient 
sensitivity  to  detect  refractive  index  fluctuations  due  to  turbulence  in  the  lower  stratosphere  and  fair  weather  cumulus 
clouds  at  100  km. 


TABLE  1-  PARAMETERS  OF  THE  MILLSTONE 
HILL  L-BAND  RADAR 
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Rain  and  refractive  index  fluctuations  are  extended 
targets  in  space.  However,  these  targets  are  inhomogeneous, 
and  the  radar  therefore  provides  an  estimate  of  the  cross 
section  of  the  target  as  averaged  over  the  radar  beamwidth 
and  one-half  the  pulse  length.  The  radar  data  presented 
here  have  been  scaled  by  the  effective  integration  volume  of 
the  radar  system  and  are  reported  as  cross  section  per  unit 
volume 

The  radar  measurements  reported  have  been  made  as 
a part  of  a program  of  microwave  propagation  research, 
which  also  includes  measurements  of  bistatic  scatter  by 
rain  and  refractive  index  fluctuations  and  passive  radio- 
metric  measurements  of  rain  [Oane,  1970  a;  Crane,  1968] . 
The  bistatic  measurements  were  made  at  X-band  using  a 
145  km  scatter  path  between  a transmitter  in  Avon, 
Connecticut,  and  the  Westford  Communications  Terminal; 
the  parameters  cf  this  system  are  given  in  table  2.  The 
radiometric  measurements  were  made  using  a X-band 
radiometer  and  the  Haystack  antenna  system;  the 
parameters  of  this  sysiem  are  given  in  table  3.  Measure- 
ments with  the  radar  and  at  least  one  of  the  other  two 
systems  were  always  made  simultaneously  with  the  receiver 
antennas  scanned  in  synchronism. 


MEASUREMENTS  OF  REFRACTIVE  INDEX  FLUCTUATIONS 


The  models  for  scattering  by  refractive  index  fluctuations  presented  by  Tatanki  [1961 J indicated  that  bistatic 
scattering  by  turbulence  should  be  one  of  the  dominant  mechanisms  for  transhorizon  scatter  propagation.  The  same 
model  also  predicted  that  high-power  radar  systems  operating  *n  the  10-  to  50-cm  wavelength  region  should  detect 
scattering  from  turbulence.  The  high-power  S-band  radar  system  at  Wallops  Island  provided  some  of  the  first  data  to 
verify  the  prediction  [Hardy  etal. , 1966] . Measurements  with  the  Millstone  Hill  L-band  radar  [Oane,  1970  ft]  and 
other  installations  have  also  verified  the  existence  of  scattering  from  turbulence. 

The  model  presented  by  Tatarski  related  the  scattering  cross  section  per  unit  volume  to  C\  , the  structure  constant 
for  turbulent  fluctuations  of  refractive  index  and  a measure  of  their  intensity: 


0i(*)«O.378X-1,iC«  (sin 

0|  (d)  cos1  <t>  (1) 


k 


i 


2-14 


where 


TABLE  2.-  AVON-WESTFORD  X-BAND  SCATTER 
PATH  PARAMETERS 


(3  cross  section  per  unit  volume 

1,  II  polarization  perpendicular  1 and  parallel  II  to 

the  plane  of  scattering 
X wavelength 

0 scattering  angle 

The  model  shows  that  for  forward  scatter,  4>  small, 
the  scattering  cross  section  per  unit  volume  is  much 
larger  than  for  radar  measurements,  </>  = n . This 
result  is  shown  in  the  experimental  data. 

Tatarski  went  further  and  proposed  a meteorolo- 
gical model  fo'  the  estimation  of  Cf,  given  profiles 
of  wind,  temperature,  and  water  vapor.  In  its  simplest 
form,  the  model  indicates  that  Cji  is,  to  first  order, 
proportional  to  the  square  of  the  gradient  of  absolute 
humidity  in  the  lower  atmosphere  or  of  potential 
temperature  at  heights  at  or  above  the  tropopause 
and  inversely  proportional  to  the  two-thirds  power 
of  the  magnitude  of  the  vector  wind  shear.  Compu- 
tations based  on  this  model  and  the  use  of  measured 
profile  data  show  that  in  the  region  at  and  above  the 
tropopause,  the  Cn  values  generally  are  higher  in 
regions  with  strong  gradients  in  potential  temperature. 
The  radar  measurements  presented  in  figures  1 through 
7 are  consistent  with  this  observation.  The  apparent 
thinness  of  the  thin  turbulent  layers  as  shown  in  these 
figures  indicate  that  the  regions  with  strong  potential 
temperature  gradients  are  rather  thin  and  profiles  of 
temperature  will  have  marked  small-scale  changes  in 
their  average  properties. 
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TABLE  3.-  RADIOMETER  PARAMETERS 
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The  radar  measurements  shown  in  figures  1 
through  7 were  made  with  the  system  described  in 
table  1 . This  system  differs  from  that  typically  used 
for  meteorological  measurements  due  to  the  use  of  a 
general-purpose  digital  computer  as  an  integral  part 
of  the  receiver.  The  computer  detects  the  radar 
returns,  incoherently  averages  the  returns,  and 
calibrates  the  receiver  system  using  signals  from  a 
noise  tube  that  is  fired  after  each  data-taking 
operation,  «nce  per  transmitted  pulse.  To  improve  the  target-detection  capability  of  the  radar,  the  recorded  data 
consists  of  the  incoherent  average  of  the  signal-plus-notse  minus  the  average  of  the  noise.  The  recorded  data  for  each 
elevation  scan  is  further  computer  analyzed  to  provide  incoherent  averages  of  the  received  signal  over  22.5  km 
horizontal  distance  interval  at  a constant  height  The  results  of  a series  of  the  horizontal  averages  at  different  heights 
is  then  assembled  into  a profile  of  C\  versus  height  as  shown  in  figure  1 ; as  used  here  is  found  by  solving 
Eq.  (1 ) given  the  radar  measurement  of  (3  . The  corresponding  computer-generated  RHI  display  is  shown  in  figure  2. 
As  an  aid  in  data  editing,  selected  elevation  scans  are  also  recorded  in  the  conventional  manner  by  photographing  a 
PPI  display.  The  PPI  display  that  corresponds  to  the  scan  used  .o  generate  figures  1 and  2 is  shown  in  figure  3. 
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Figure  C profiles  for  2 August  1968,  1720  GMT,  253" 
azimuth  made  using  the  Millstone  Hill  radar. 
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The  radar  system  was  operateu  to  record  data  at 
distance*  between  80  and  1 50  km  from  the  radar  and 
heights  between  5 and  30  km.  At  100  km  range  and  at 
the  elevation  angles  used,  the  effective  integiation  volume 
of  the  radar  is  approximately  1.1  km  in  height.  1.1  km  in 
horizontal  distance  normal  to  the  plane  of  the  radar  and 
scattering  volume,  and  1 s.Tt  in  horizontal  distance  in 
the  plane  of  the  radar  and  scattering  volume.  The  height 
resolution  capability  of  the  radar  at  100  km  is  I 8 km 
which  corresponds  to  the  10  dB  beamwidth  of  the 
antenna.  This  indicates  that  thin  layers  closer  together 
than  1 .8  km  in  height  will  not  be  resolved  t d.  for  layers 
thinner  than  l.l  km,  the  measured  layer  strength  will  be 
a function  both  of  the  layer  thickness  and  C p . Since 
most  of  the  layers  probed  by  a U-2  aircraft  used  in  one 
part  of  the  measurement  program  were  the  order  of  100  m 
thick,  it  is  best  to  view  the  layer  strci.gtn  measurements  as 
providing  the  value  of  the  height  p.oduct  li  the 
value  for  the  81.1  to  103.4  km  average  would  be  4X10~15 


layer  at  14  km  in  figui  r I were  100  m thick,  the  actual 
m_1'3(2X10~l  ®cm-3'3).  A homogeneous  layer  that  is  thinner  than  I km  would  have  different  Cp  ralun  reported 
because  the  percentage  of  the  beam  filled  depends  on  the  distance  from  the  radar.  For  thin  layers  ;he  ratio.-  of  the 
measured  layer  intensities  are  1 : 1, 25:1. 50:1.  respectively,  for  the  81.1-  to  10  • 4- , 104.5-  to  127.00- , and  128.0-  to 
150.5-km  distance  intervals.  The  relative  magnitudes  of  the  Cp  profiles  for  the  layer  at  4 km  shew  that  if  the  layer  were 

homogeneous,  it  would  be  thinner  than  1 km  and  the 
value  reported  is  a lower  bound  on  the  layer  strength 
Unfortunately,  the  majority  of  profile  measurements 
show  horizontal  variations  in  layer  strengths  and  estimation: 
of  layer  thickness  based  upon  this  technique  are  dubious 

The  radar  system  has  been  calibrated  so  that  the 
measured  Cp  value  for  a layer  that  fills  the  beam  is  w:tiun 
±1 .5  dB  of  the  actual  value.  The  stability  of  the  radar, 
the  characteristics  of  the  radar  target,  and  the  pufse-by- 
pulse  calibration  system  used  in  making  the  measurements 
provides  a relative  rms  enor  of  less  than  ±0.7  dB.  The 
major  sour.es  of  error  in  making  the  Cp  profile  measure- 
ments are  interference  from  other  radars  operating  in  the 
same  frequency  bend,  reflections  from  fixed  ground 
targets  detected  through  the  side  lobes  of  the  antenna 
pattern,  and  refections  from  urcraft  detected  through 
the  side  lobes.  Each  error  source  is  monitored  during 
the  measurement  program.  Interference  is  monitored 
by  the  relative  magritude  of  the  variance  of  the  receiver 
noise  temperature  which  is  continuously  measured.  The 

80-  to  ISO-km  measurement  range  was  dictated  by  ground  clutter  problems.  Fixed  targets  and  aircrafts  at  ranges  between 
80  and  ISO  km  must  be  manually  detected  using  the  computer-generated  RHI  displays.  Data  contaminated  either  by 
interference  or  extraneous  targets  were  rejected. 
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Figure  2.-  Computer  generated  RHI  display  for  2 August  1968. 
1720  GMT.  233'  azimuth. 
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The  L-band  radar  used  can  readily  detect  turbulent  layers  with  a Cp  height  product  greater  than  1 X I0-1  ‘ra'1  3 
IcmfSXlO-1  *cnra'3km).  Unfortunately,  a cloud  of  the  same  dimension  and  a Z of  6X I0X  lO^mmVm3  (see  Eq. 
(2))  for  liquid  particles  or  a Z of  3XI0_3mms/m3  for  ice  particles  will  give  the  same  return  This  corresponds  to  a 
cloud  of  liquid  particles  with  a liquid  water  content  the  order  of  0.005  g/m3  The  detectic  of  thin  turbulent  layers 
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is  theiefore  complicated  by  masking  by  cloud  layers. 

Foi  this  reason,  only  layers  in  the  clear  atmosphere 
may  be  detected.  Since  a single-frequency  radar  is 
used,  auxiliary  means  must  be  used  to  determine  if 
clouds  are  present.  For  layers  near  the  tropopause, 
the  presence  of  cirrus  clouds  must  be  determined. 

The  separation  between  thin  turbulent  layers  and 
citrus  cloud  layers  has  been  made  using  several  criteria. 
If  the  layer  is  detected  above  the  tropopause  in  a 
region  where  the  radiosonde  data  show  a definite 
lack  of  moisture,  the  layer  is  assumed  to  be  turbulent 
If  the  layer  is  thin,  on  the  order  of  the  resolution 
volume  of  the  radar,  and  is  thin  throughout  the 
duration  of  the  test,  it  is  assumed  to  be  turbulent. 

If  no  cirrus  clouds  are  visually  observed  by  pilots, 
ground-based  observers,  and  the  available  TPQ-1 1 cloud 
radars  at  the  height  of  the  layer,  the  layer  is  assumed 
to  be  turbulent.  Whenever  doubt  existed  as  to  the 
origin  of  the  radar  return,  the  layer  was  assumed  to 
be  caused  by  clouds. 

Using  the  above  layer-detection  criteria,  the 
data  from  34  days  of  measurements  randomly  dis- 
tributed between  January  1968  and  August  1968 
were  examined  for  the  existence  of  thin  turbulent 
layers  at  heights  above  the  tropopause.  Th<;  height 
of  the  tropopause  was  determined  using  the  radiosonde 
data  closest  both  in  time  ard  space  to  the  meas  -rement 
volume.  Two  types  of  layers  are  reported,  persistent 
and  transient.  A persistent  layer  is  * layer  at  a specific 
height  that  is  present  in  more  than  half  the  elevation 
scans.  The  measurements  were  made  by  making  two 
elevation  scans  at  a particular  azimuth;  then  moving 
the  antenna  to  a new  azimuth  for  another  set  of 
elevation  scans.  The  azimuth  used  for  the  measure- 
me:>.s  changed  from  day  to  day.  Typically  a series 
of  measurements  were  made  at  five  separate  azimuths 
spaced  over  a 1 80°  interval  interlaced  with  a series 
of  measurements  at  azimuths  spaced  over  a 20° 
azimuth  interval.  With  the  azimuth  stepping  methc  J 
used  for  data  taking.  * layer  had  to  be  present  over  a 
range  of  a zim.iu  values  to  be  classified  as  persistent. 

A series  of  measurements  made  on  May  28. 1968 
is  shown  in  figures  4 and  S.  In  this  set  of  data,  a 
persistent  layer  existed  at  the  tropopause  level.  The 
layer  at  14-km  height  i»  also  classified  as  persistent 
since  it  is  present  in  more  than  half  the  elevation  scans. 
As  shown  by  the  azimuths  lor  which  the  scans  were 
made,  the  persistent  layers  are  not  confined  to  a single 
azimuth  sector  but  are  detected  at  a wide  variety  of 
azimuths.  These  layers  extend  more  than  300  km 


Figvrs  2.-  RHI  display  for  2 August  1968,  I7Z0GM1  233' 
azimuth  made  using  ihc  Millstone  Hill  radar 


0- 

60  80  100  120  140  160 

SUBFACC  DISTANCE 


Figure  4 - Computer  generated  RHI  display  Tor  28  May  1968. 
1439  GMT,  120°  azimuth 


.tor-— 

»■ 

:»• 


azimuth  (msi 

a i iiimn»Tinuii 


* 

y io 


Htlji  l*tt»ttd»*tBH4nnrTnttTTiiT  * * 

■Mill  !IIIIII'IIIIlIII,|liii!l!l|.1.,. 

] ll  n-i  ■ inniiiimiHii  niiiiiiiiii  . 


I I » 

I Z * 10* 
I 4 » 10"* 
CM*  Aw  lor«'« 


Car.*  Clou— 


ISC  J 


i«oo  iroo 

TIME  (OMT) 


'•00 


Figure  5.-  Radar  detected  (tmospherv  layer  slrudu'c  for 
28  May  1968. 
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% horizontally.  The  intensity  of  the  layeis,  however,  changes  in  time  and  position.  The  intensity  of  the  layer  at  tropopause 

> height  in  figure  5 fluctuates  in  time  with  approximately  a 1-hr  period.  The  data  on  figure  5 also  show  three  transient 

layers  at  heights  above  the  tropopause.  Again,  these  transient  layers  tend  to  be  widespread  in  horizontal  extent  but 
£ weaker  in  intensity. 

£ The  data  for  each  of  the  measurement  days  are  presented  in  table  4.  The  results  aie  given  hy  dav.  The  table  also 

,i  gives  the  number  of  layers  per  2.5-km  height  interval  for  heights  above  10  km  and  the  total  number  of  layers  detected 

above  the  tropopause  for  each  day.  The  data  generally  show  a layer  just  above  the  tropopause.  The  layer  probably  is 
t associated  with  the  tropopause  but,  due  to  the  uncertainty  in  both  the  existence  and  location  of  a tropopause,  it  is 

hard  to  make  a definite  judgment  about  such  an  association.  The  feature  of  the  radiosonde  profiles  most  often  associated 
i.  with  the  layers  that  occur  above  the  tropopause  is  - temperature  inversion, 

v' 


TABLE  4.-  TURBULENT  PERSISTENT  (P)  AND  TRANSIENT  (T)  LAYER  DETECTIONS 
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Table  IV 

Turbulent  Layer  Detection* 


Uycn  I Layer!  from  RHI  Date 


Date 

1968 

Time 

CMT 

Max. 

cloud 

level 

km 

Tropo- 

pause 

Height 

km 

above 

tropo- 

pause 

P T 

10-12.5 

km 

P T 

12.5-  IS 
km 

P T 

15-  17.5 
km 

P T 

17.5-20 

km 

P T 

20  - 22. 5 
km 

P T 

>22.  5 
km 
P T 

1/9 

1800-2100 

6 

12.9 

1 

1/11 

1420-2050 

- 

11.0 

i 

1 

1 

2/5 

1720-2130 

2 

6.  5 

i 

1 

2/9 

1400-2050 

6 

10.0 

i 

1 

2/ ‘3 

1340-1850 

6 

7.3 

1 4 

1 1 

t 

1 

1 

1 

2/19 

1810-2100 

5 

7.6 

2 

1 

1 

3/7 

1410-2050 

1 

9.5 

6 

1 

1 

1 

1 

2 

3/1* 

1410-1840 

6 

11.5 

1 2 

1 

1 

1 

3/20 

1400-2100 

10 

11.8 

2 

1 

1 

1 

3/21 

1730-2100 

>u 

.2.6 

1 

l 

1 

3/22 

1340-1700 

10 

12.5 

1 2 

1 

1 

1 

4/4 

1800-2130 

u 

12.5 

1 

1 

1 

4/15 

1330-2120 

9 

ll.S 

2 

t 

1 

l 

4/24 

1600-1800 

9 

12.0 

1 

1 

1 

5/b 

1310-1900 

2 

12.7 

l 

5/16 

1310-1710 

9 

12.5 

1 

1 

1 

5/20 

1440-1830 

2 

10.5 

1 1 

1 

1 

1 

5/28 

1400-1953 

9 

12.  5 

1 J 

1 

1 

1 

1 

1 

5/29 

1300-2030 

II 

12.  3 

5 

1 1 

2 

1 

1 

6/3 

1410-1950 

10 

10.4 

2 

1 1 

1 

6/5 

l 550-1930 

0 

12.8 

1 

1 

1 1 

6/li 

1350-1720 

12 

14.8 

2 

1 

2 

6/27 

1230-630 

1 1 

14.8 

3 

1 

I 

1 

1 

6/28 

1240-1600 

8 

12.  7 

1 

1 

1 

7/29 

1430-2020 

3 

13  8 

1 3 

1 

1 

1 

1 

1 

1 

7/30 

1340-1950 

1 1 

12.4 

2 

1 

1 

1 

7/31 

1230-2020 

11 

12.8 

1 1 

1 

1 

| 

8/1 

1330-0100 

9 

14.  5 

2 

1 

1 

1 

1 

6,2 

1400-2020 

5 

13.8 

1 

1 

1 

1 

8/5 

1220-2030 

8 

13.  1 

1 

l 

1 

1240-2100 

9 

14.2 

2 

1 

1 

1 

\ 

t 

„/  7 

1230-0100 

9 

14.4 

1 

6/8 

1240-2020 

7 

14.  1 

2 

1 1 

! 

1 

1 

8/ 9 

1300-2020 

9 

12.6 

1 1 

i 

1 

Total 

Total  of  107  layer 

9 57 

12  14 

18  13 

3 20 

14 

2 5 

4 

34 

OetCClCG  ADOVI  IU  Rffl 

1 

66 

26* 

31 

23 

14 

7 

4 

Total  it  low  dut  to  rejection  of  any  layer  with  clouds. 


v 


| 


2-18 


Transient  layers  have  been  detected  at  heights  up  to  10  km  above  the  tropopause.  The  maximum  height  for  a 
layer  detection  is  25  km.  Weaker  layers  probably  exist  above  this  height  but  the  radar  system  sensitivity  is  not  high 
enough  to  detect  them.  The  transient  layers  may  also  appear  persistent  to  a more  sensitive  radar. 


Simultaneous  bistatic  scatter  measurements  were  also 
using  the  Avon-to-Westford  scatter  system  outlined  in  table 
ments  are  compared  in  figures  6 and  7.  Figure  6 shows  the 
the  results  of  scanning  the  Westford  antenna  in  elevation 
while  pointing  along  the  great  circle  path.  The  measure* 
ments  shown  by  the  points  show  the  layered  structure. 

The  arrows  point  to  the.  locations  of  layers  that  also  exist 
in  the  radar  data.  The  horizontal  dotted  lines  show  the 
transmission  loss  value,  or  ratio  of  transmitted-to-received 
power,  ( PtIPr ) , computed  using  the  bistatic  radar  equa- 
tion, the  scattering  angle  dependence  given  in  Eq.  (1),  and 
the  radar  estimate  of  Cn  ■ At  elevation  angles  below  5° 
agreement  is  evident  between  the  computed  and  measured 
values.  Above  5°,  coupling  via  the  sidelobes  of  the 
Westford  antenna  becomes  important  and  the  computation 
model  br«*ks  down.  The  data  show,  however,  that  the 
model  proposed  by  Tatarski  that  relates  C«  and  0 is 
useful  in  the  interpretation  of  scattering  from  thin  tur- 
bulent layers. 


MEASUREMENTS  OF  RAIN  SHOWER 
STRUCTURE 


made  of  scattering  from  refractive  index  fluctuations 
2.  The  radar  measurements  and  bistatic  scatter  measure- 
Cn  profile  as  measured  by  the  radar.  Figure  7 shows 


Millatona  Hill  L-Bond  Rodor 
Cl  Profile  Meoturoment 


2 Aug  1968 
220*  Azimuth 


1630-1636  GMT 


Figure  6.-  C£  profiles  for  2 August  1968, 1630  GMT, 
220°  azimuth  made  using  the  Millstone  Hill  radar. 


The  scattering  cross  section  for  rain  is  related  to 
Z,  the  sum  of  the  sixth  power  of  the  drop  diameters  by 
Rayleigh  scatter  theory: 

0l(*)=”5lK|2Z 

0| (0)  * 0jfo)  cos2  <t>  (2) 

where  !xl2  * |(e-l)/(e+2)|2  , e * dielectric  constant  of 
water,  and  the  other  parameters  are  defined  in  Eq.  (1). 
Equation  (2)  is  valid  for  frequencies  below  10  GHz.  The 
parameter  Z is  a function  only  of  the  drop-size  distribu- 
tion and  may  be  considered  to  be  a meteorological  param- 
eter that  is  descriptive  of  rainfall  intensity.  The  relation- 
ship between  Z and  the  rainfall  rate  R is  a statistical 
one  that  depends  on  the  properties  of  drop-size 
distributions.  The  relationship  between  Z and  R 
normally  used  by  radar  meteorologists  is  Z*  200R1'6 
when  Z is  expressed  in  mm6 /m3  and  R inmm/h. 
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Figure  7.-  Transmissk  i loss  vi  receiver  elevation  angle, 
8.0°  transmitter  elevation  angle. 
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Radar  measurements  of  rain  generally  show  a marked  horizontal  inhomogeneity.  Although  propagation  specialists 
and  climatologists  often  consider  two  rain  types,  stratiform  and  convective  (or  showery),  measurements  with  high- 
resolution  radars  generally  show  that  both  rain  types  are  inhomogeneous  and  quite  showery. 
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Figure  8.-  Weather  radar  map  for  New  England  showers. 
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Figure  9.-  Estimated  and  measured  antenna  temperature 
relative  to  a 2°  elevation  angle  clear  sky  value. 


The  showery  structure  of  rain  is  also  evident  m 
radiometric  measurement:  of  rainfall  made  using  the 
Haystack  radiometer  system  described  in  table  3 . Radio- 
metric  observations  'ere  made  by  scanning  the  antenna 
across  rain  showers  of  interest.  The  radiometric  measure 
mer.ts  made  simultaneously  with  the  radar  measurements 
presented  in  figure  8 are  presented  in  figure  9.  The  points 
labeled  as  measured  and  as  averaged  measured  tempera- 
ture are  measured  changes  in  antenna  temperature  from 
the  clear  sky  value  for  an  elevation  angle  of  2.0°.  The 
solid  lines  enclose  the  ±1  standard  deviation  estimate 
of  antenna  temperature  change  based  on  computations 
using  an  approximate  solution  to  the  radiative  transfer 
equation  and  estimates  of  attenuation  cross  section  per 
unit  volume  based  on  the  radar  measurements.  The 
relationship  between  aUenuation  cross  section  and 
scattering  cross  section  was  derived  statistically  from  a 
large  number  of  computations  using  measured  drop-size 
distributions  [Crane,  1968] . The  data  presented  in 
figure  9 are  the  direct  measurements  of  antenna  tempera- 
ture changes  - not  changes  in  sky  or  brightness  tempera- 
ture. The  true  value  of  sky  temperature  change  are  higher 
and  may  be  determined  by  dividing  the  listed  temperature 
values  by  the  integrated  antenna  efficiency,  which  is  0.6 
for  the  elevation  angles  used.  The  measurements  show 
reasonable  agreement  between  measurements  and 
estimates  based  on  radar  data. 

Bistatic  measurements  of  rain  shower  structure  were 
made  using  the  scatter  system  described  above.  The  ob- 
jective of  the  measurement  program  was  to  measure  the 
scattering  angle  dependence  of  natural  rain.  The  measure- 
ments also  depict  the  showery'  nature  of  rain.  Figure 
10  shows  a radar  measurement  similar  to  that  in  figure 
8.  The  simultaneously  made  bistatic  scatter  observations 
are  shown  in  figure  1 1 . Note  the  rapid  spatial  change 
of  rain  intensity.  The  cells  evident  in  figure  10  are 
marked  by  arrows  in  figure  1 1 . The  horizontal  lines 
give  the  estimated  transmission  loss  based  on  the  several 
scattering  models  listed  and  the  use  of  the  bistatic 
radar  equation.  The  only  agreement  shown  here  is  at 
an  azimuth  3ngle  of  215°.  At  the  other  azimuth  angles, 
obstacles  existed  between  the  transmitting  antenna 
and  the  scatter  volume  and  agreement  was  not  obtained. 

In  a recent  set  of  measurements  (Chine,  1971  ],  a 


large  number  of  comparisons  were  made  with  no  intervening  obstacles.  Tie  results  of  the  latter  set  of  measurements 
show  agreement  with  the  Rayleigh  model  within  the  measurement  accuracies  of  the  radar  and  scatter  systems. 
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Figure  10.-  Weather  radar  map. 


Figure  11.-  Transmuiion  loss  measurements  vs  receive 
antenna  azimuth,  3.0°  receive  elevation  angle. 


SUMMARY 


A series  of  measurements  of  the  structure  of  two  atmospheric  parameters  detectable  by  radar  techniques  were 
presented.  The  measurements  were  obtained  as  part  of  a program  of  microwave  propagation  research  which  has  as  its 
object  the  testing  of  models  used  to  predict  propagation  phenomena.  The  measurements  show  the  inhomogeneity 
typical  of  atmospheric  parameters.  Experience  obtained  in  the  course  of  the  research  program  has  indicated  that  as 
radars  are  developed  that  have  higher  resolution  volumes,  measurements  of  atmospheric  phenomena  will  show  significant 
changes  in  the  intensity  of  the  phenomena  over  the  smallest  distances  resolvable. 
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DISCUSSION 


Chang  Are  you  familiar  with  Tatarski’s  1967  book  in  which  he  shows  Cn  is  a function  of  temperature  ard 
ind  velocity  and  also  extends  the  equation  over  a wide  range? 

Of ’ll  ■ 1 have  net  seen  the  translation  but  there  is  not  a big  change  from  his  earlier  work.  Some  people,  in  fact, 

doubt  his  extension.  Otner  recent  Russian  work  is  even  more  important  but  difficult  to  apply.  They  show  that 
details  within  a volume  tend  to  govern  Cn  and  we  can't  get  this  from  profile  measurements.  All  we  can  find  is 
the  region  to  expect  a layer  but  not  its  stiength. 

Unidentified  speaker:  How  did  you  obtain  estimates  of  rain  drop-size  distributions  and  how  critical  are  they? 

Crant  From  a wide  number  of  sources,  pi -dominantly  camera  surveys  of  the  Illinois  State  Water  Survey.  New 
AFCRL  measurements  improve  on  these.  We  can  classify  rain  by  type  and  get  better  estimates  of 
attenuation  given  reflectivity  and  rain  rate.  Our  accuracy  is  about  30%  standard  error  of  estimate  in 
attenuation  cross  section  per  unit  volume  due  to  drop-size  distribution  affects.  The  AFCRL  results  may 
reduce  this  number. 
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ABSTRACT 


A ground-based  vertically  pointing  FM-CW  radar  (frequency  modulated,  continuous  wave)  is  described  that 
permits  remote  probing  of  the  refractive  index  structure  in  the  troposphere.  The  radar  has  the  characteristics  of 
extremely  high  sensitivity,  ultrahigh  range  resolution,  and  close  minimum  detection  range  without  clutter.  The 
sounder  routinely  detects  layer  structures  in  the  lower  troposphere.  These  layers  are  always  associated  with  gradients 
in  the  vertical  refractive  index  profile,  and  are  frequently  very  thin,  approaching  the  resolution  of  the  radar  ( 1 m). 
Very  often  they  are  perturbed  by  wave  motions.  Examples  of  various  wave  patterns  are  presented,  and  an  explanation 
is  given  for  organized  substructures  frequently  superimposed  on  larger  scale  wave  motions. 


INTRODUCTION 


One  important  advantage  of  remotely  probing  the  atmosphere  is  the  capability  for  continuously  monitoring 
the  layer  structure  in  depth.  Pulsed  radar  has  long  been  used  in  several  countries  to  detect  the  height  of  such  layers, 
but  the  lack  of  resolution  has  precluded  the  possibility  of  examining  structural  detail.  Only  thick  diffuse  layers 
were  normally  revealed,  although  these  occasionally  showed  interesting  wave-like  structures.  Two  kinds  of  returns 
were  observed:  diffuse  echoes  from  regions  of  large  gradient  of  refractive  index,  and  “dot  angels”  from  insects  and 
birds.  Since  the  refractive  index  depends  strongly  on  moisture  and  temperature,  radar  was  evidently  an  important 
potential  instrument  for  remotely  probing  the  meteorological  structure  of  the  clear  troposphere,  if  an  improved  radar 
system  could  be  developed.  Such  a radar  has  been  designed  and  built  at  NELC.  It  is  a unique  FM-CW  radar  (frequency 
modulated,  continuous  wave)  that  differs  from  previous  radar  sounders  mainly  in  two  features:  extremely  high  range 
resolution  (1  m)  and  low  ground  clutter.  Its  narrow  beams  are  directed  vertically,  which  permits  us  to  observe  features 
as  low  as  50  m horn  the  ground.  The  sounder  has  been  in  operation  since  December  1968,  and  observations  of  great 
variety  have  been  obtained.  The  radar  observations  are  accompanied  by  both  standard  and  special  meteorological 
observations  at  the  ground  and  by  radiosonde  soundings.  The  radar  sounding  system  and  some  early  observations 
have  been  described  by  Richter  [1969] . 


DESCRIPTION  OF  RADAR 


For  the  study  of  the  fine-scale  refractive  index  structure  in  the  troposphere  we  need  a radar  sounder  that 
combines  the  properties  of  high  range  resolution,  high  sensitivity,  and  close  minimum  range  without  clutter.  The 
requirement  of  high  range  resolution  capability  would  suggest  the  use  of  pulse  compression  techniques.  However, 
this  approach  would  make  it  difficult  to  meet  the  additional  requirement  of  close  minimum  range,  for  no  signal  can 
be  received  during  the  transmission  of  the  uncompressed  pulse  (and  usually  several  pulsewidths  after).  The  reason 
that  the  minimum  range  is  such  an  important  factor  for  the  sounder  is  that  low  level  inversions  frequently  produce 
scatter  layers  in  the  lowest  300  m of  the  atmosphere,  and  these  layers  can  be  studied  most  effectively  by  simultaneous 
meteorological  observations  by  tower  and  tethered  balloon  instruments. 
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An  FM-CW  radar  docs  not  have  the  minimum  range  restriction  and  allows  high  range  resolution.  It  has  the  additional 
advantage  of  a peak-power-to-average-power  ratio  of  unity,  which  is  very  desirable  from  a technological  viewpoint.  It 
also  permits  a change  of  resolution  without  affecting  the  transmitted  energy  (which  is  not  the  case  for  pulse  radars). 

The  frequency  of  the  radar  was  chosen  to  be  around  3 GHz  as  a compromise  between  higher  and  lower 
frequencies,  each  of  which  would  have  been  desirable  for  different  reasons.  Thus,  to  achieve  a narrow  antenna  beam 
with  a small  antenna  diameter,  it  would  have  been  better  to  select  a higher  frequency.  However,  we  must  also  consider 
the  reflectivity-wavelength  dependence  of  the  scattering  volume.  For  specular  surfaces,  the  power  reflection  coefficient 
increases  as  the  thickness  in  wavelengths  of  the  transition  zone  decreases,  and  we  should  thus  want  to  use  the  largest 
possible  wavelength.  On  the  other  hand,  the  preponderance  of  available  evidence  suggests  that  clear-air-scatter  layers 
comprise  inhomogeneiiies  in  refractivity.  producing  a reflectivity  (rj)-wavelength  dependence  of  about  X 1 3 [Atlas 
and  Hardy.  1966:  Hardy  et  al.,  1966;  Hardy  and  Katz.  1969]  Although  this  wavelength  dependence  is  small,  it 
suggests  using  the  shortest  wavelength,  provided  that  it  does  not  approach  the  limiting  microscale  (8^)  ol  the 
refractivity  perturbations.  Atlas  and  Hardy  1 1966] . therefore,  suggest  an  optimum  wavelength  of  about  five  times 
the  limiting  microscale.  In  weakly  turbulent  conditions,  trr.  !*  of  ihe  order  of  1 to  2 cm  rnd  so  a X of  5 to  10  cm 
seemed  optimum. 

By  means  of  theoretical  calculations  and  experimental  data  |Sa.tron  et  al , 1964] . we  decided  that  with  an 
equivalent  pulse  length,  li=lm,  and  antenna  gain  of  35  dB,  a ratio  of  190  dB  between  average  transmitted  and 
received  power  should  be  adequate  to  detect  most  scatter  layers  in  the  lowest  2 km. 

The  linear  frequency  modulation  is  generated  with  a YIG  tuned  transistor  oscillator,  amplified  to  a maximum 
power  of  1 50  W and  radiated  by  the  transmitting  antenna.  The  signals  are  displayed  on  cathode  ray  tubes  in  an  A-scope 
presentation,  or  an  intensity  modulation,  or  in  a combination  of  both  amplitude  and  intensity  modulation  (or  A-l  scope). 
The  data  are  recorded  by  35-mm,  shutterless  movie  cameras  with  film  speeds  ranging  from  1 cm/hr  to  0.5  cm/s ec. 

The  antennas  consist  of  two  parabolic  dishes.  3 m in  diameter,  with  waveguide  feeds  (fig.  I).  For  good  isolation, 
the  antennas  were  located  in  pits.  At  least  60  dB  isolation  is  necessary  to  avoid  saturation  of  the  preamplifier.  The 

insides  of  the  pits  are  lined  with  microwave  absorbers 
to  suppress  reflections.  The  absorbing  screens  are  also 
required  to  suppress  echoes  from  nearby  ground  targets. 
The  antennas  are  steerable  within  ±2.5°  to  optimize 
the  common  volume  for  a given  height. 

The  performance  characteristics  of  the  radar  are 
compiled  in  table  1 . The  basic  components  of  the  radar 
system  allow  operation  in  the  enure  2-  to  4-GHz  range 
The  restriction  to  the  narrower  range,  around  2.9  GHz, 
is  determined  by  the  standing  wave  ratio  of  the  antenna 
feed  and  the  balanced  mixer.  The  variable  range 
resolution  is  an  important  feature  of  the  radar.  In  the 
case  of  volume  scattering,  signal  amplitude  changes 
continuously  with  resolution,  thereby  permitting 
significant  conclusions  to  be  drawn.  The  maximum 
range  resolution  of  about  1 m was  determined  experi- 
mentally. Two  targets  93  cm  apart  (simulated  by 
delay  lines)  were  separable  by  choosing  F * 2 1 9 MHz 
frequency  excursion. 

The  values  for  computing  minimum  detectable  cross  section  and  minimum  detectable  reflectivity  are  based  on 
a transmitted  power  of  100  W and  on  a minimum  detectable  signal  of  -1 50  dBm.  This  minimum  detectable  signal 
was  determined  by  using  a delay  line  with  a delay  time  equivalent  to  that  of  a target  at  a height  of  167  m. 
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TABLE  1 PERFORMANCE  CHARACTERISTICS  OF  THE  NELC  RADAR  SOUNDER 


PARAMETER 

VALUE 

REMARKS 

Power 

150  W max 

Typical  power:  100  W 

Center  frequency 

2.8  to  3.1  GHz 

Frequency  excursion 

Variable 

Linear  modulation 

Range  resolution 

Variable 

Maximum  resolution:  1 m for  200-MHz  frequency 
excursion 

Sweep  duration 

50  msec 

10  sweeps/sec 

Receiver  noise  figure 

5 dB 

Minimum  detectable  signal 

-150  dBm 

Antenna  gain 

35  dB 

Antenna  beamwidth 

2.5  deg 

Isolation  between  antennas 

105  dB 

Minimum  detectable  cross  section  at  1 km 

2 3.7  X 10‘6  cm2 

Minimum  detectable  reflectivity  at  1 km 

24.2  X 10‘,s  cm'1 

For  1-m  range  resolution 

The  sensitivity  of  the  radar  for  a point  target  is  given  by 


°min  = 3.7X10"" *r4 

where  is  the  detectable  cross  section  in  cm1 , and  r is  the  distance  of  the  target  in  kilometers. 

Typical  insects  with  cross  sections  of  10-3  cm2  [Hardy  and  Katz,  1969] , therefore,  can  cause  dot  echos 
24  dB  above  noise  at  1-km  height.  For  distributed  targets  the  minimum  detectable  reflectivity  rjmjn  in  cm-1 
is 


%iin  = 4.2X10 -‘V2//! 


where  r is  the  distance  in  kilometers  and  h is  range  resolution  in  meters.  Typical  values  of  tj  for  a 10-cm  wave- 
length are  10_1 4 - 10_l  4 cm-1  [Hardy  et  al„  1966;  Atlas  and  Hardy,  1966;  Hardy  and  Katz,  1969] . It  can  be  seen  that 
even  for  the  highest  range  resolution  of  1 m,  returns  can  be  expected  from  as  high  as  1 km.  To  detect  the  weaker 
scattering  regions  at  greater  ranges,  we  should  have  to  increase  the  effective  pulse  depth  or  resolution,  provided  of 
course  that  the  scatter  regions  are  thicker  than  the  effective  pulse. 

Even  though  there  is  no  basic  minimum-range  restriction  inherent  in  this  kind  of  radar,  some  practical  limitations 
(which  could  be  overcome,  if  necessary)  reduce  the  sensitivity  of  the  radar  for  targets  closer  than  50  m to  the  antennas. 
Reflections  from  an  adjacent  building  give  permanent  returns  for  the  first  50  m in  height;  therefore,  targets  must  be 
capable  of  providing  stronger  returns  or  must  change  their  position  in  order  to  be  discernible.  Also,  the  separation 
of  the  two  antennas  imposes  a minimum  height  range  for  beam  intersection.  The  antennas  are  separated  by  5 m,  and 
their  beams  intersect  within  50  m above  the  ground. 
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RESULTS 


Figure  2 is  an  example  of  layers  that  are  observed  by  the  radar  system.  The  50-mtn  section  of  an  intensity 
modulated  trace  shows  a complex  vertical  structure  consisting  of  thin,  undulating  scatter  layers. 

A sounding  balloon  carrying  two  radiosonde  transmitters,  which  were  modified  to  give  continuous  last  response 
temperature  and  humidity  profiles,  was  launched  at  the  radar  site.  The  balloon  was  tracked  from  the  ground  to  obtain 
wind  information.  The  vertical  profiles  of  temperature,  relative  humidity,  wind  sreed,  and  wind  direction  arc  plotted 

in  figure  2.  The  height  scale  for  the  balloon  sounding 
and  the  radar  return  is  the  same  The  refractive  index 
>V  = (n-l ) 10®  ard  the  potential  temperature  are 
computed  from  the  direct  measurements 

The  temporal  continuity  of  the  radar  observation 
technique  may  be  used  for  presentations  of  the  kind 
shown  in  figure  3.  which  is  handdrawn  to  achieve  the  time 
compression  necessaiy  for  illustration  purposes  It  shows 
height  and  thickness  of  the  layer  for  a 2-day  time  period 
No  simple  diurnal  behavior  of  layer  structure  has  been 
established  so  far,  even  though  continuous  observations 
for  as  long  as  4 weeks  have  be  ■ studied.  This  may  be 
due  to  superposition  of  other  meteorological  factors  on 
a diurnal  effect.  It  is  also  difficult  in  the  case  of  multipie- 
layer  structures  to  objectively  identify  and  connect  strata 
with  strong  fading  characteristics  that  alternately  appear 
and  disappear. 
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Figure  2.-  Scatter  layers  observed  by  the  radar  and  profiles  of 
meteorological  data  obtained  from  balloon  soundings.  RH, 
relative  humidity;  N * (n-l)10*.  refractive  index;  t, 
temperature. 


An  excellent  example  of  a multiple-layer  structure 
is  presented  in  figure  4.  Several  distinct  strata  of  various 
vertical  thicknesses  and  temporal  behavior  can  be  seen; 
superimposed  is  the  refractive  index  and  potential  tempera- 
ture profiles  obtained  from  a radiosonde  sounding  at  1238  PST.  The  agreement  of  backscatter  zones  and  refractive  index 
gradients  is  remarkable  The  significance  of  this  layer  structure  on  radar  coverage  is  illustrated  in  figure  S,  in  which  a 
transmitter  at  a height  of  100  ft.  is  assumed.  The  earth  is  drawn  flat,  which  gives  a straight  ray  an  upward  bend.  From 

the  ray  trace  picture  we  should  expect  a hole  in  coverage 
at  a range  of  around  25  nautical  miles.  We  also  see  from 
the  ray  trace  picture  that  only  the  layer  belonging  to  the 
steepest  gradient  influences  the  coverage  significantly. 

This  layer  changed  about  80  m in  height  between  1220  and 
1240  PST,  which  could  influence  an  accurate  coverage 
prediction.  This  example  exphasizes  the  need  for  precise 
and  continuous  knowledge  of  the  refractive  index  structure, 
if  accurate  coverage  predictions  are  required.  Many  more 
examples  of  layer  structures  with  radiosonde  sounding 
data  are  presented  by  Richter  and  Gossard  [1970] 

Radar  reflectivity  is  obtained  from  the  recorded 
A-scope  presentation.  Examples  of  quantitative  analyses 
are  given  by  Richter  [1969]  and  Atlas  et  al.  [1970],  In 
general,  the  maximum  reflectivities  measured  with  the 
FM-CW  radar  sounder  are  one  to  two  orders  of  magnitude 
Fifu/e  3.-  Approximate  height  and  thickness  of  layer  structure  higher  than  those  observed  with  radars  having  much 
observed  continuously  from  6-®  January  1969.  coarser  range  resolution. 
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The  radar  backscatter  properties  of  water  droplets 
(and,  therefore,  of  rain)  are  well  known.  What  is  not 
well  known  is  the  fine-scale  structure  of  rainfall,  and  the 
radar  sounder  offers  a powerful  tool  for  studying  it.  This 
possibility  has  not  been  fully  exploited  in  San  Diego  for 
two  reasons.  First,  it  is  not  associated  with  the  purpose 
for  which  the  sounder  was  built;  and,  second,  San  Diego 
has  yearly  average  rainfall  of  only  about  10  in.,  which  is 
not  enough  for  extensive  study.  Figure  6 illustrates  the 
capability  of  the  sounder  with  respect  to  rain.  The  uppei 
portion  of  this  figure  is  the  intensity-modulated  radar 
record  of  a multiple-layer  structure  with  intermittent 
rain  showers  starting  at  1152  PST  and  1 222  PST  The 
echo  intensity  of  the  rain  overexposed  the  intensity- 
modulated  record.  The  A-l  record  of  the  time  interval 
from  1 151-1 158  PST  is  shown  in  the  lower  part  of  the 
figure.  Structural  details  of  the  rain  shower  are  evir*  it. 

The  intensity  ol  the  rain  with  respect  to  time  varies, 
leading  to  dif'.erent  “streaks."  The  slope  of  the  streaks 
gives  the  ra.nfall  velocity.  The  deviations  from  J constant 
slope  mu'.t  be  due  to  wind  shear.  The  gap  in  the  record 
betwee  i 1 1 56  to  1 1 57  PST  stems  from  a baseline  shift, 
which  is  caused  by  a strong  echo  (probably  bird  or  airplane) 
overdriving  the  radar  receiver.  Figure  6 is  one  example  of 
v.hat  can  be  studied  with  the  sounder.  Other  recordings 
exist  in  which  rain  evaporates  before  it  reaches  the  ground. 
The  ice-water  transition  is  another  problem  that  could  be 
studied,  as  the  scatter  properties  of  ice  crystals  are 
different  from  those  of  raindrops. 
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Figure  4.-  M iltiple  layer  structure.  The  dashed  curve  is  the 
refractive  index  profile;  the  solid  curve  the  potential 
temperature  measured  with  a sounding  balloon. 


Perhaps  the  most  exciting  observations  made  with 
radar  sounder  concern  the  detailed  structure  of  wave 
motions  in  the  troposphere  The  presence  of  such  wave* 
has  been  known  ( Gossard  and  Munk,  1954] , and  they 
have  been  recorded  by  other  radars  on  rare  occasions 
when  the  size  scales  and  amplitudes  were  large  enough 
for  their  much  coarser  resolution.  The  present  sounder 
showed  that  the  waves  are  ubiquitous,  and  occur  with 
vertical  amplitudes  of  as  little  as  some  10  m.  It  also 
revealed  details  of  wave  motions  that  had  never  been 
observed  before.  Many  presentations  and  interpretations 
of  wave  motions  observed  with  the  sounder  have  been 
given  [Gossard  el  al. . 1970;  Atlas  etal.,  1970;  Gossard 
and  Richter,  1970] , 

A good  example  of  wave  patterns  is  given  in  figure 
7,  which  shows  f he  same  recording  at  three  different 
enlargements.  Two  diffeient  types  of  wave  motion  are 
dearly  distinguishable:  long-period  (5  to  10  min.  or 
longer)  internal  waves,  and  much  shorter-period  Kelvin- 
Helmholtz  (K-H)  wave  instability  structures  sometimes 
superimposed  on  the  longer-period  waves.  The  enlarge- 
ments in  the  middle  and  lower  part  of  the  figure  reveal 
short-period  and  small-amplitude  K-H  wavei  that  show 


Figure  5.-  Ray  trace  for  the  refractive  index  profile 
and  layer  structure  of  figure  4. 


it  lutr  its* 


l»||  M||  !l»»  Ml*  MM  "M  ••»»  MM 

• at  n»n  - 


Figure  6.-  Structure  of  tain. 
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dear  evidence  ot  breaking.  A perfect  sequence  of  a K-H 
wave  is  shown  in  figure  8.  The  wave  builds  up  until  it 
overturns  and  breaks,  and  the  air  masses  at  the  interface 
become  mixed.  The  lower  part  shows  a section  .if  the 
same  event,  but  in  A-l  p-esentation.  This  kind  of  pre- 
sentation shows  particularly  clearly  the  sharp  boundaries 
and  the  small  vertical  extent  of  the  zones  from  which 
the  electromagnetic  energy  is  relumed 


Cases  in  which  the  radar  illuminate  ; the  wave 
structure  through  considerable  depth  are  of  special 
interest  because  they  can  reveal  details  of  the  region 
where  true  wave  motion  couples  with  the  instability 
structures  at  the  critical  level.  This  requires  radar 
returns  from  multiple  layers  over  considerable  depth. 
Figure  9 shows  layers  above  and  below  the  major  wave- 
train  located  between  400  and  S00  m.  The  major  wave- 
train  perturbs  the  upper  stable  layers,  but  has  no  apparent 
effect  on  the  layers  below,  which  tend  to  consist  of 
independent,  breaking  wavetrains.  The  vertical  structures 
between  100  and  170  m are  caused  by  convection  m the 
marine  layer  below  the  stable  region. 


Figure  7.-  Wive  pittemi. 


In  an  effort  to  examine  the  fine  structure  of  the 
scattering  layers  in  more  detail,  the  radar  record;  were 
recorded  on  magnetic  tape.  Since  the  dynamic  range 
of  the  magnetic  tape  is  much  greater  than  that  of  film, 
various  film  exposures  could  be  run  from  the  tape 
recording,  ar.d  a minimum  exposure  used  for  examina- 
tion of  the  parts  of  the  layers  which  would  normally  be 
overexposed.  When  this  is  done  with  layers  such  as 
those  shown  in  figure  10,  an  interesting  substructure 
is  shown  in  the  lower  frame  of  the  figure.  It  reveals 
the  presence  of  organized  structures  (rather  tlun 
randomly  inhomogeneous  turbulence)  down  to  the 
minimum  size  resolvable  by  the  radar  (about  1.5  m in 
this  case).  This  substructure  bears  an  impressive 
resemblance  to  the  large  instability  features  such  as 
those  shown  in  figure  8,  which  have  been  interpreted 
as  K-H  structures  due  to  shear.  Gonardetal  (1971) 
have  suggested  that  untrapped  internal  waves  propagating 
i.-pward  or  downward  through  temperature  inversion 
layers  can  create  their  own  wind  shear  within  the  layers, 
and  can  also  locally  modify  the  temperature  gradient. 

The  criterion  of  dynamic  stability  (Richardson's  number) 
is  thus  reduced  in  certain  height  regions  within  the  wave 
structure;  and  if  Richardson's  number  becomes  less 
than  0.25,  there  is  theoretical  reason  to  believe  that 


Figure  g.-  Breaking  wive;  upper  part,  intimity  modulation; 
lower  pm.  A-l  presentation. 


dynamic  instability  can  occur.  II'  gravity  wave 
perturbations  occur  at  the  levels  ol  instability  (either 
by  propagating  from  other  levels  or  somehow  locally 
generated),  K-H  instability  should  cause  those  perturba- 
tions moving  with  the  local  wind  speed  to  grow  and 
eventually  break,  creating  the  intense,  small-scale 
turbulent  inhomogeneities  that  are  apparently  “seen" 
by  the  radar.  Ambient  shear  across  the  stable  layer 
is  observed  to  exist  almost  l( XK?  of  the  time  in 
Southeri.  California.  Present  evidence  supports  the 
hypothesis  that  multiple,  thin,  dynamically  unstable 
layers  are  created  by  untrapped  waves  (whose  wave 
vector  is  nearly  vertical)  imbedded  in  a medium  whose 
ambient  shear  is  sufficient  to  locally  reduce  the 
Richardson's  number  below  critical. 


CONCLUSIONS 


The  FM-CW  radar  sounder  is  a novel  tool  for 
the  investigation  of  the  structure  of  the  troposphere. 

It  has  yielded  significant  re.jlts  to  date,  in  particular 
in  the  area  of  no  .r  reflectivity  and  atmospheric  wave 
motions.  The  sounder  permits  continuous  monitor- 
ing of  the  vertical  refractive  index  structure  in  the 
troposphere  with  range  resolution  as  high  as  I m. 

Layer  structures  are  detected  routinely.  As  they  may 
change  their  height  in  short  time  intervals,  a continuous 
knowledge  - not  only  of  their  presence  but  also  of 
their  exact  height  - is  necessary  for  accurate  radio 
and  radar  propagation  predictions.  The  sounder  is  an 
excellent  tool  for  the  study  of  the  structure  of  rain. 

The  sounder  will  continue  to  play  an  important  role 
in  the  field  of  mesoscale  meteorology,  in  which  con- 
ventional sounding  techniques  are  extraordinarily 
difficult. 
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Figure  9 - Wave  structures 


Figure  10.-  Top  frame  shows  gravity  waves  in  which  radar 
return  is  apparently  not  phase  related.  Dottom  frame 
shows  same  record  with  reduced  film  exposure  and  reveals 
kdvin/Hebnholtz  substructure,  which  apparently  accour  It 
for  the  radar  return.  Sloping  straight  lines  are  captive 
balloon  echo. 
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DISCUSSION 


Wright:  In  both  this  and  Dr.  McAllister's  paper  I was  struck  by  the  instability  structure  showing  two  sine  waves 

of  about  the  same  period  but  180°  out  of  phase  - apparently  an  oscillation  between  two  distinct  patterns. 

Richter:  This  has  puzzled  us  for  a long  time.  One  should  not  be  deceived,  however,  and  interpret  thir  as  a spatial 
picture.  We  see  the  wave  motion  Doppler  shifted  by  the  wind  sampled  at  one  point  in  space. 

Unidentified  Speaker:  Do  you  have  independent  wind  measurements?  Do  the  waves  move  with  the  wind,  or  do 

they  have  a separate  propagation  velocity? 

Richter:  They  don’t  necessat  lly  propagate  with  the  wind.  Refer  to  Gossard’s  paper  in  JGR  197C.  W.  do  have 

means  of  deducing  true  spatial  wavelength  with  pressure  recording  measurements  on  the  grout 

Crane:  We  visualize  Kelvin-Helmholtz  instabilities  that  break  down  ami  you  get  turbulence.  We  so  *t  no 

scattering  until  the  breakdown  occurs.  But  you  have  the  wave  well  traced  before  that.  Perhap  seeing 

a different  kind  of  turbulence. 

Fiichter:  We  talk  about  turbulence  at  different  scale  sizes.  The  breaking  vaves  themselves  consist  of  smaller  scale 

turbulence  and,  as  1 showed,  organized  substructures.  There  is  much  more  to  be  learned.  We  can  relate  radar 
reflectivity  to  mean-square  fluctuations  in  refractive  index,  as  done  notably  by  Bean  in  Boulder.  But  we  can 
not  construct  refractive  index  profiles  from  the  measurements  at  this  time. 

Kay:  You  are  working  with  an  FM  radar,  which  results  in  your  integrating  over  50  msec.  All  the  returns  you  see 

within  the  resolution  element  are  contributions  of  temporal  integration.  Therefore,  you  are  working  with 
turbulence  of  lower  scale  than  the  thickness  of  layer. 

Richter:  The  integration  over  50  msec  does  not  determine  our  spatial  resolution  of  1 m.  We  sample  the  spectrum 
of  turbulence  at  half  the  radar  wavelength  (5  cm). 

Kay.  Why  do  you  see  the  beginning  of  turbulence? 

Richter:  Because  you  see  turbulence  at  a much  smaller  scale  which  exists  independent  of  the  larger  scale  turbu- 
lence induced  by  breaking  waves. 

Bojarski:  Therefore  we  should  go  to  higher  resolutions. 

Richter:  That’s  right. 

Oane:  We  don’t  know  what  we  are  seeing.  Is  it  turbulence  or  something  else? 
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ABSTRACT 


IJdar  uses  laser  energy  in  radar  fashion  to  observe  atmospheric  backscattering  as  a function  of  range.  Because 
of  the  short  optical  and  near-optical  wavelengths  used,  very  small  particles  and  even  the  gaseous ' iolecules  cause 
significant  scatte  ig.  This  can  complicate  the  evaluation  of  the  observations  by  introducing  attenuation  along  the 
path  as  a second  unknown  into  the  lidar  equation.  In  man  cases,  however,  the  observations  may  be  interpreted 
directly  on  a qualitative  basis  and  show  the  distribution  of  particulate  matter  in  clear  a;*  or  enable  the  dimensions 
of  visible  cloud  to  be  measured  accurately.  In  other  cases,  particularly  where  additional  data  are  available,  quantita- 
tive solutions  can  provide  useful  information  on  remote  targets  such  as  tenuous  smoke  clouds  01  ..aze  layers  Examples 
of  such  observations  are  given,  illustrating  the  computational  approach  to  the  evaluation  of  the  volume  concentration 
of  natural  dust  and  haze  layers  in  the  lower  atmosphere  and  the  mass  concentration  of  a smoke  plume.  In  both  cases 
lidar  data  are  related  to  independently  obtained  data  on  the  particulate  concentrations  involved. 


INTRODUCTION 


Lidar  is  a technique  that  uses  laser  energy  in  radar  fashion  for  remote  probing  of  the  atmosphere.  It  combines 
the  principles  of  weather  radar  with  older  optical  scattering  concepts  to  provide  an  important  new  facility  for  observing 
particulate  concentrations  and  evaluating  atmospheric  optical  parameters. 

The  technique  has  recently  been  reviewed  by  Collis  [1970] , who  describes  its  application  in  general  meteorology 
and  air  pollution  studies  mainly  in  the  lower  atmosphere,  while  Kent  and  Wright  [1970]  emphasize  its  use  in  upper 
atmospheric  probing.  In  this  paper  the  problem  of  deriving  quantitative  data  from  lidar  observations  is  discussed  and 
examples  are  given  of  successful  quantitative  interpretation  in  two  actual  cases. 


BASIC  LIDAR  CONCEPT 


In  its  basic  form,  lidar  employs  a laser  simply  as  a source  of  pulsed  energy.  Typically  Q-switched  ruby  or  neo- 
dymium doped  glass  systems  are  used  to  generate  pulses,  having  peak  power  of  tens  of  megawatts  and  durations  of 
10  to  20  nsec  that  are  directed  in  highly  collimated  beams  by  suitable  optical  systems.  Energy  backscattered  by  the 
atmosphere  is  detected  by  a photomultiplier  after  collection  by  a lens  or  reflector  s>»icin.  The  resulting  signal  is 
evaluated  as  a function  of  time  from  the  transmission  of  the  pulse,  in  radar  fashion,  and  is  typically  displayed  on  an 
oscilloscope  either  directly,  or  after  storage  in  a magnetic  disc  video  recorder.  Polaroid  or  other  photographs  may 
be  used  to  record  the  displayed  data. 
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Taking  advantage  of  the  monochromatic  nature  of  the  laser  energy . nanow  band  opticc'  filters  may  be  used 
in  the  receiver  system  to  minimize  “noise”  caused  by  extraneous  light.  The  nature  of  the  received  signal  Pr  is 
given  by 


Pr(R)  = PtL0(R)AR~2  exp 


(1) 


where 

Pr  instantaneous  received  power 
Pt  transmitted  power  at  tQ 

L effective  pulse  length  (8);  f=cr/2  where  c is  the  velocity  of  light  and  t is  pulse  duration;  it  is  the  range 
interval  from  which  signals  are  simultaneously  received  at  time  t 

0 the  volume  backscatt  :ing  coefficient  of  the  atmosphere  (sr-1 8-1 ) 

R range  [f?  = C(t  - 10)/2]  where  tj  is  the  time  of  transmission  of  pulse 

a volume  extinction  coefficient  (8  ) 

A effective  receiver  aperture. 

The  magnitude  of  0 and  a depend  on  the  wavelength  of  the  incident  energy,  and  the  number,  size,  shape,  and 
refractive  properties  of  the  illuminated  particles  per  unit  volume.  These  range  from  the  hydrometeors  of  precipitation, 
cloud,  or  fog  through  the  particulate  components  of  the  “clear”  aerosol,  to  the  molecules  of  the  gaseous  atmosphere. 

The  scattering  and  absorption  mechanisms  involved  have  been  widely  studied.  For  the  moment  it  will  sufuce  to  note 
that  at  the  wavelengths  commonly  employed  (for  exan.ple,  0.6943ji  for  ruby  lasers  and  1 .06ju  foi  neodymium  lasers) 
the  energy  is  readily  scattered  by  particulate  material  of  even  very  clear  atmospheres  and  typical  lidar  systems  can 
detect  returns  from  the  clear  gaseous  upper  atmosphere. 

In  relatively  clear  atmospheres  the  extinction  coefficient  is  small  and  variations  in  backscattering  as  a function  of 
range  can  readily  be  interpreted.  For  example,  the  presence  of  a sudden  increase  in  scattering  due  to  a cloud  or  haze 
layer  is  obvious  and  unequivocal.  Similarly  a marked  reduction  in  turbidity  is  apparent  as  a decrease  of  signal  intensity 
at  the  range  of  the  discontinuity.  In  strongly  scattering  conditions,  however,  marked  attenuation  occurs  and  in  observing 
a cloud,  for  example,  the  lidar  return  rapidly  diminishes  as  the  penetration  of  the  cloud  increases.  The  evaluation  of 
lidar  signals  in  such  circumstances  thus  presents  certain  difficulties  and  imposes  the  need  for  care  in  making  qualitative 
inte'preta»iot.s. 


EVALUATION  OF  THE  BASIC  LIDAR  EQUATION 


The  atmospheric  volume  extinction  coefficient  a , itself  largely  due  to  scattering  (although  some  gaseous  and 
particulate  absorption  may  occur),  is  in  gene.al  related  to  the  volume  backscattering  coefficient  0 . Where  the 
scattering  particles  are  small  compared  to  wavelength,  as  is  the  case  with  gaseous  molecules  or  the  smaller  particles 
of  the  natural  aerosol,  Rayleigh  scattering  applies  and  the  relationship  is  consistent  and  0 = (1 .5/4rr)o.  However, 
in  so-called  “Mir  scattering, ” where  particle  size  is  comparable  to  wavelength,  small  changes  in  particle  diameter  cause 
marked  changes  in  backscatter  cross  section.  (For  example,  a change  in  the  diameter  of  a spherical  particle  by  1 /100 
can  cause  a 20  to  1 variation  in  its  backscatter  cross  section.)  Although  substantial  averaging  occurs  over  the  range  of 
particle  sizes  present  in  typical  natural  aerosols  (such  as,  haze,  cloud,  or  fog),  the  relation  between  the  volume 
extinction  coefficient  and  the  volume  backscattering  coefficient  is  by  no  means  consistent,  particularly  with  the 
monochromatic  energy  generated  by  lasers  [Twomcy  and  Ho  , 1965] . 
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This  uncertainty  has  important  implications,  for  the  lidar  equation  can  only  be  solved  for  the  optical  parameters 
p and  a if  they  can  be  related.  Where  the  relationship  remains  constant  = as  a function  of  range,  the  lidar 
signal  can  be  evaluated  by  solving  the  lidar  equation  in  a differential  form,  provided  that  kx  is  known  and  a reference 
value  for  P or  a is  available.  This  condition  applies  to  atmospheric  conditions  in  which  the  particle  size  distribution 
does  not  vary  substantially  and  for  which  values  for  k , can  be  adduced  from  knowledge  of  the  particle  distribution, 
or  deduced  from  other  lidai  measurements  (for  example,  of  calibrated  targets).  Average  values  of  Ar,  may  be  employed 
for  different  types  of  atmospheric  target*  In  cases  where  pa.  tide  size  distributions  are  not  constant,  as  in  the  case  of 
cloud  and  various  conditions  of  mist  and  fog,  a power  iaw  relationship  (dJn^/dCtio  ~ k j ) may  be  employed  to  derive 
at  least  approximate  solutions.  Solutions  of  either  form,  it  should  be  noted  are  very  sensitive  to  errors  in  the  initial 
assumptions  or  to  errors  in  measurement. 

Additional  data  in  other  forms  may  be  useful.  For  example,  the  total  path  extinction  or  transmittance  of  a cloud 
or  haz  * layer  may  be  derived  from  comparison  of  observations  of  the  clear  air  on  the  near  and  far  side  oi  the  layer  The 
additional  information  on  transmittance  may  then  be  used  to  check  and  improve  the  assumptions  made  regarding  the 
PI  a relationship,  as  well  as  to  provide  reference  values.  The  use  of  targets  of  known  reflectivity  may  be  used  in  a some- 
what similar  way  to  calibrate  lidar  performance  or  derive  path  transmittance  for  “visibility”  measurement.  In  the  special 
case  of  an  homogeneous  atmosphere  (P  and  a constant)  the  lidar  equation  may  be  simply  solved  (for  o = 

-d inPfR1  l2dR) . Similarly,  where  the  optical  parameters  vary  only  in  the  vertical,  a solution  possible  from  a series 
of  two  or  more  lidar  observations  made  at  different  angles  of  elevation,  in  either  case,  the  degree  of  homogeneity  of 
the  atmosphere  may  be  apparent  from  the  lidar  observations  themselves. 


EXAMPLES  OF  LIDAR  OBSERVATIONS  OF  ATMOSPHERIC  CONDITIONS 


Observation  and  fcvaluation  of  Particulate  Content  of  Relatively  Clear  Air 

In  this  example,  sequences  of  observations  made  with  a neodymium  lidar  from  an  aii craft  were  used  to  investigate 
the  particulate  content  in  the  atmospheric  layer  below  3 km  over  the  sea  in  connection  wiih  the  Barbados  Oceanographic 
and  Meteorological  Experiment  (BOMEX)in  1969  [Uthe  and  Johnson,  1971] . 

Figure  i a computer-drawn  representation  of  lidar  S values,  These  S values  represent  range-normalized  signal 
returns  in  logarithmic  foim  th it  are  corrected  for  instrumentation  transfer  anomalies  and  any  pulse-to-pulse  lidai  per- 
formance variations.  They  thus  evaluate  in  relative  terms  the  atmospheric-dependent  parameters  of  the  lidar  equation: 

P exp  -2/a  dr.  Assuming  that  attenuation  may  be  neglected,  a i OX 3 km  vertical  cross  section  of  relative  volume 
concentration  of  particulate  material  is  shown  in  absolute  terms  (fig.  1(a))  and  in  the  form  of  anomalous  departures 
from  a best-fit  exponential  distribution  (fig.  1(b)).  Tins  dramatically  emphasizes  the  layer  at  1 .8  km  altitude,  which 
is  interpreted  as  the  stream  of  dust  carried  by  the  NE  trade  winds  to  the  Caribbean  area  from  the  Sahara  desert.  The 
assumption  that  attenuation  is  negligible  in  such  conditions  and  the  relationship  of  the  lidar  data  to  absolute  volume 
concentrations  were  investigated  with  the  help  of  particle-size  distiibutions  from  an  independent  aircraft  sampling 
program.  Using  these  data  and  Mie  theory  expected  lidar  signal  returns  wcie  computed:  these  are  shown  for  three 
occasions  in  figure  2. 

The  dashed  lines  represent  simulated  relative  lidar  returns,  ignoring  the  effect  of  atmospheric  attenuation  of  the 
energy  pulse.  The  corresponding  unbroken  lines  represent  another  set  of  simulated  relative  lidar  returns  obtained  after 
taking  into  account  the  attenuation  of  the  laser  energy.  A lidar  altitude  of  3 km  is  assumed.  By  comparing  the  dashed 
and  solid  profiles,  which  show  only  about  1 dB  difference  at  the  surface  for  the  two  hazy  days,  it  is  seer,  ibat  the 
neglect  of  atmospheric  attenuation  does  not  significantly  affect  the  return  signal  profile.  Therefore,  analysis  o ' particu- 
late mass  or  volume  concentration  need  not  involve  the  assumptions  required  for  rigid  solution  of  the  lidar  equation. 

The  lidar  system  constant  may  be  inferred  by  comparing  the  computed  lidar  returns  with  actual  lidar  returns. 

The  relative  signals  from  three  lidar  traces  each  recorded  durinr,  the  aerosol  sampling  period  are  shown  in  figure  2 as 


2-33 


Figure  I.-  Digital  presentation  ol  lid*/  observed  aerosol  structure  during  BOMEX  (30  June  1*169.  I0SM056  EDT.  Run  ™-7) 

(Bravo  and  x-ray  are  locations  in  the  experimental  area) 

The  location  of  each  observation  is  indicated  bv  vertical  marks  along  the  top  lines  of  the  section. 

heavy  solid  lines.  The  three  computed  and  three  observed  profiles  were  first  plotted  on  two  separate  graphs  and  the 
horizontal  displacement  between  these  graphs  was  adjusted  for  the  best  overall  fit  of  computed  and  observed  data. 

The  abscissa  values  represent  the  observed  lidar  signals  in  relative  logarithmic  uoits. 

Good  agreement  exists  between  computed  and  observed  relative  lidar  return  signals  in  bo ih  day-to-day  and  altitude 
variations.  This  result  indicates  that  the  lidar  characteristics  imoortant  to  the  absoiut-  aerosol  densities  could  be  infers 
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from  the  lidar  backscalter  signatures,  subject  to  errors 
due  to  nonlinear  variations  between  particulate  density 
and  the  volume  backscatter  coefficient.  Figure  3 
presents  a plot  of  the  particulate  volume  concentration 
as  a function  lidar  return  signal.  This  graph  is 
derived  from  Mie  theory,  with  the  abscissa  determined 
from  the  best  fit  between  lidar  and  particle  count  data. 
The  scatter  of  the  data  points  is  inherent  from  Mie  theory 
and  is  a result  of  insufficient  averaging  of  irregularities 
between  particle  size  and  the  particle  backscatter  cross 
sections  [Twomey  and  Howell,  1965).  Nevertheless, 
these  investigations  show  that  useful  quantitat’ve  infor- 
mation on  the  particulate  content  of  the  “clear”  air 
can  readily  be  derived  from  lidar  observations,  and, 
given  the  density  of  the  particulate  phase,  the  aerosol 
mass  concentration  may  be  inferred. 


Measurement  of  Mass  Concentration  of  a 
Smoke  Piume 

In  this  example  the  technique  was  used  to  infer 
the  total  particulate  mass  per  unit  length  of  a plume 
from  the  smoke  stack  of  a large  power  plant  [Johnson 
and  Uthe,  1971] , illustrating  the  evaluation  of  ruby 
lidar  observations  when  it  is  necessary  to  account  for 
attenuation  of  the  laser  energy  within  the  target . 

The  problem  may  be  approached  by  solving  a 
differential  equation  in  terms  of  S values  derived  from 
the  lidar  equation  (1).  Thus 


& - 4.34  -Lii 
d R 0 d R 


8.7  a 


(2) 


where  Bouquer’s  law  of  attenuation,  relating  a to 
transmittance  T, 


R 

o(i)  d2 


(3) 


has  been  assumed  and  o(fi)  is  the  volume  extinction 
coefficient  at  a distance  C along  the  path.  The 
solution  of  Eq.  (2)  for  particulate  mass  concentration 
along  the  laser  beam  path  requires  relating  the  optical 
parameters  o and  0 to  the  particulate  mass  concen- 
tration. For  this  purpose  the  ratio  of  each  optical 
parameter  to  the  particulate  mass  concentration  M 
(mass/volume)  is  defined  as 


(4) 
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Figure  2.-  Derivation  of  lidar  calibration  from  particle  count  data. 
Actual  lidar  profiles  are  shown  as  heavy  lines  while  backscatter 
profiles  evaluated  from  particle  count  data  are  shown  as  less 
heavy  lines  extending  to  above  3.0  km. 
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Figure  3.-  Lidar  return  signal  and  volume  backscatter 
coefficient  related  to  particulate  volume  concentration. 
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Substitution  of  Eq.  (4)  into  the  lidar  differentia!  equation  (Eq.  t2'»)  insults  in  an  expression  relating  mass  concen- 
tration to  the  lidar  "''rrr.zlized  signal  return. 


d M 
d R 


— — M - 2 (eM2  = 0 
4.34  d R € 


(5) 


where  fy,  has  been  considered  not  to  vary  with  range.  The  validity  of  this  assumption  will  be  discussed  later.  This 
nonlinear  equation  may  be  solved  by  first  employing  the  linearization  transform  rj  = M~l  with  the  result 


i]|^'  - 2£*  /^expfC.SWJdsj 


(6) 


where  C\  ~ 1/4.34.  This  solution  principally  assumes  that  1)  £e  and  are  invariant  with  range  and  the  vale 
of  %e  is  known  or  can  be  computed,  and  2)  a boundary  or  clear-air  particulate  mass  concentration  Ma  is  known 
or  can  be  estimated. 


Microscopic  analysis  of  the  fly  ash  effluent  from  the  smoke  stack  in  question  clearly  showed  that  the  particles 
were  mostly  glass  spheres,  apparently  formed  from  the  mineral  content  of  the  coal  by  the  intense  heat  of  the  com- 
bustion process;  hence,  the  use  of  Mie  theory  in  deriving  estimates  of  £e  and  is  justified.  To  a high  degree  of 

accuracy,  the  volume  extinction  coefficient  at  the  lidar  wavelength  of  1 .06/ u may  be  considered  to  be  essentially 
dependent  only  on  the  atmospheric  particulate  matter,  and  the  £ values  may  be  expressed  as: 


L 

!. 
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jjra3  pn(a)da 
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f ™*Qb(x, 
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jtra3pn{a)ua 


where 

a particle  radius 

Qe  extinction  efficiency  factor 

Qb  backscatter  efficiency  factor 

x size  parameter  * 2 rra/X 

m refractive  index  of  particles 

X wavelength  of  the  User  energy 

n number  of  particles  per  unit  volume  unit  radius  interval 
p density  of  the  particulate  matter 


(7) 


(8) 
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To  be  precise,  the  parameters  if  and  {j,  defined  by  Eqs.  (7)  and  (8)  are  exactly  equivalent  with  those  defined  by 
Eq.  (4)  only  when  multiple  scattering  effects  are  absent.  This  is  a good  assumption  because  of  the  narrow  field  of  view 
of  the  lidar  system  (<  1 mrad). 

The  efficiency  factors  may  be  computed  from  Mie  theory  for  a given  particulate  matter  refractive  index,  the  real 
part  of  which  in  this  esse  was  estimated  to  have  a value  of  approximately  1 .5,  based  on  the  known  mineral  composition 
of  coal.  The  requirement  that  ie  and  fy,  be  independent  of  range  is  satisfied  when  the  relative  particle-size  distribu- 
tion is  invariant  with  range;  that  is,  when 


n(R.a)  = C(R)nr(‘ i) 


(9) 


where  C(R)  is  the  number  concentration  and  nr(a)  is  the  relative  size  distribution.  Also,  with  this  assumption  only 
the  relative  particle  size  distribution  is  required  to  evaluate  the  { values.  However,  nr(a)  is  rarely  completely  indepen- 
dent of  range,  and  the  question  may  be  asked  as  to  how  small  changes  in  the  relative  size  distribution  are  reflected  in 
the  ( values.  Since  the  ? values  are  expressed  in  terms  of  integrals  over  the  range  of  particle  sizes,  monodisperse 
(single-size)  particles  represent  the  worst  case  in  terms  of  ( variations  with  changes  in  the  particle  size.  Mie  computations 
of  i e and  are  presented  in  figure  4 for  the  case  of  monodisperse  particles  consisting  of  material  with  a density  of 
2.1  g/cm3  (as  measured  for  the  fly  ash  in  question)  and  a refractive  index  of  m=l  .5— *0  for  incident  radiation  of  wave- 
length 1 .06^1. 


As  shown,  slight  variations  in  particle  size  :an  cause 
large  variations  in  if,  and  smaller  but  significant  variations 
in  . A real  aerosol,  however,  contains  particles  of  many 
sizes,  so  the  variations  in  the  { curves  tend  to  be  smoothed 
out  when  a weighted  integral  for  all  sizes  present  is  com- 
puted. Relative  particle-size  distribution  data  independently 
obtained  by  sampling  from  an  aircraft  were  made  available 
to  us.  along  with  information  about  sampling  volumes  and 
total  particle  counts.  Some  of  these  sets  of  data  were  re- 
duced to  absolute  size  distributions,  to  which  a Junge-type 
model  was  fitted  (that  is,  of  the  form  n = where 

a0  and  y are  fitted  constants,  and  n and  a are  as  defined 
earlier).  However,  there  is  some  uncertainty  about  the 
accuracy  of  the  sampling  volume  values  used  to  compute  the 
absolute  distributions,  since  direct  conversion  of  some  of  the 
dear-air  measurements  to  mass  concentrations  gave  unrealis- 
tically high  values  (M0  > 1000  ug/m3)  of  concentration  u; 
the  clear  air  upwind  of  the  suck.  Since  the  relative  size 
distributions  are  sufficient  for  use  with  the  lidar  daU  and 
also  are  felt  to  be  more  accurate,  they  have  been  used  ex- 
clusively  in  our  analyses.  The  evaluation  of  the  solution 
parameters  ^ and  if,  from  these  particulate  data  (24 
samples  taken  on  several  days  at  various  times  and  at  various 
positions  within  the  plume)  gave  values  ranging  from  0.2  to 
1.0  mJ/g  and  from  0.014  to  0.064  m3  / g*sr,  respectively. 

The  corresponding  overall  means  were  OiS  mJ/g  and  0.039  m1  / g’sr. 
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Figure  4.-  Optical  cross  sections  per  unit  mass  as  a function  of 
particle  size  computed  from  Mie  theory  for  fly  ash. 


The  lidar  observations  and  particle  sampling  of  the  plume  were  best  coordinated  for  cross  section  64.  one  of  the 
series  of  vertical  sections  made  through  the  plume  by  scanning  the  lidar  in  elevation.  This  cross  section  was  oriented 
approximately  cross  wind,  and  this  case  was  chosen  to  apply  the  mass  concentration  solution  discussed  previously. 

The  boundary  parameter  M0  was  taken  from  an  estimate  of  visibility  (7  to  10  miles),  which  gives  M0  * 100  ug/m3 , 
using  the  relationship  of  Noll  etal.  [1968],  Computation  of  if  with  the  aid  of  the  particle-size  measurements  gave 
a value  of  0.25  m3/g. 
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Figure  5.  - Lidar  obiemtioni  of  plume  crow  section  64 
in  relative  logarithmic  unit*. 

(a)  Without  attenuation  correction 

(b)  With  attenuation  correction 

(c)  Attenuation  correction 


With  this  information,  the  lidar  S function  cross 
section  (fig.  5(a))  was  converted  by  using  Eq.  (6)  to 
an  attenuation-corrected  cross  section  of  absolute  mass 
concentration  (fig.  5(b)).  Figure  5(c)  represents  the 
results  of  the  graphical  subtraction  of  (b)  from  (a),  and 
thus  shows  contours  of  mass  concentration  (in  logarithmic 
units)  added  to  the  further  side  of  the  plume  by  the  atten- 
uation correction.  It  may  be  noted  that  the  maximum 
correction  is  9 dB,  which  means  that  the  mass  concentra- 
tion in  that  region  of  the  uncorrected  cross  section  in 
figure  5(a)  was  almost  10  times  too  low.  Although  the 
attenuation  correction  does  change  concentrations 
significantly  in  one  part  of  the  cross  section,  the  general 
shape  and  structure  of  the  plume  remain  the  same. 

A planimeter  was  used  to  integrate  the  total  mass 
per  unit  plume  length  represented  by  ihe  cross  sections 
in  figure  5(a)  and  5(b).  The  results  were  as  follows: 
uncorrected  cross  section  5(a),  350  g/m;  corrected  5(b), 
680  g/m.  The  attenuation  correction  thus  was  significant 
in  terms  of  the  mass  content.  These  values  were  compared 
with  an  estimate  of  the  particulate  emission  rate  from 
the  stack.  Using  observed  data  on  plant  operation  and 
standard  conversion  techniques  gave  a plume  lengthwise 
mass  content  of  875  g/m,  compared  with  the  680  g/m 
measured  by  the  lidar  after  allowance  for  attenuation. 
Although  further  evidence  will  be  necessary  from  other 
comparisons,  the  close  agreement  of  this  initial  example 
does  indicate  that  the  lidar  technique  may  be  capable  of 
useful  quantitative  measurements  of  mass  concentration. 

CONCLUSIONS  AND  OUTLOOK 
FOR  THE  FUTURE 


The  lidar  technique  is  a unique  method  of  observing 

atmospheric  structure  and  opens  up  new  ways  of  deriving  intelligence  of  atmospheric  conditions  and  processes.  This 
intelligence  can  be  a vital  input  into  many  operational  activities  in  meteorology  and  atmospheric  physics  such  as  aviation 
and  air  pollution,  to  name  but  two  obviously  important  areas.  In  research,  the  value  of  lidar  is  no  less  great.  Even  with 
the  relatively  modest  equipment  used  to  date,  lidar  has  provided  a new  insight  into  a number  of  atmospheric  phenomena 
and  revealed  shortcomings  in  our  current  understanding  of  certain  processes  (for  example,  in  turbulent  diffusion  from 
high  smoke  stacks). 

In  certain  applications  its  role  will  be  to  make  quantitative  observations  of  atmospheric  characteristics.  In  some 
cases,  huwever,  it  may  well  be  that  the  difficulties  of  reducing  and  interpreting  the  observation  in  absolute  terms  will 
limit  the  use  of  lidar  to  semiqualitative  observations.  The  remote  detection  and  mapping  of  structural  features  of  the 
atmosphere  by  reference  to  relative  variations  in  its  optical  characteristics  is  a most  valuable  capability,  however,  with 
obvious  utility  in  many  areas  of  atmospheric  research  and  operational  meteorology. 
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DISCUSSION 


Unidentified  Speaker:  Have  you  evaluated  the  role  of  multiple  scattering  in  your  signal? 

Uthe:  Yes.  Multiple  scattering  normally  plays  very  little  role  in  lidar  scattering  because  we  are  usmg  milliradian 

beamwidths  and  receiver  field-of-views.  The  only  important  multiple-scattering  effect  is  the  energy  scattered 
back  into  the  forwai  d directed  beam,  and  this  can  be  accounted  for  by  using  the  aerosal  forward-scattering 
coefficient. 


ACTIVE  ATMOSPHERIC  SOUNDING 
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3.  RADIO-OCCULTATION  MEASUREMENTS 


This  chapter  which  treats  both  active  atmospheric  and  ionospheric  sounding,  forms  a bridge  between  Chapters 
2 and  4.  This  science  is  concerned  with  ground  based  observations  of  coherent  radio  transmissions  as  they  traverse, 
and  thus  are  dispersed  by,  tne  atmosphere  and  ionosphere  of  a planet.  Information  concerning  these  gases  is  obtained 
just  before  and  just  after  the  satellite-borne  transmitter-earth-based  receiver  transmission  path  is  occulted  by  the 
planet  -that  is,  near  immersion  and  emersion  of  the  satellite. 

The  radio-occultation  data  are  inverted  to  yield  profiles  of  the  refractive  index,  which  with  appropriate  assump- 
tions yield  profiles  of  temperature,  pressure,  density,  and  electron  density.  Inversion  algorithms  have  been  developed 
specifically  for  this  problem;  inversion  algorithms  developed  by  seismologists  are  also  being  used.  These  methods 
are  described  and  compared  in  this  chapter  and  in  parts  of  Chapter  7.  The  subject  of  stellar  (noncoherent)  occulta tion, 
which  has  been  used  by  astronomers  to  infer  the  presence  and  character  of  rlanetary  and  satellite  atmospheres,  is  not 
specifically  discussed  in  this  volume  but  is  obviously  related  to  the  presr-.i  subject. 

M.  D.  Grossi  organized  and  chaired  the  session  devoted  to  radio-occultation  measurements. 
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The  methods  of  Abel  integral  transform  and  ray-tracmg  inversion  have  been  applied  to  data  received  from 
radio  occultation  experiments  as  a means  of  obtaining  refractive  index  profiles  of  the  ionospheres  and  atmospheres 
of  Mars  and  Venus.  In  the  case  of  Mars,  certain  simplifications  are  introduced  by  the  assumption  of  small  refractive 
bending  in  the  atmosphere.  General  inversion  methods,  independent  of  the  thin  atmosphere  approximation,  have 
been  used  to  invert  the  data  obtained  from  the  radio  occultation  of  Mariner  5 by  Venus;  similar  methods  will  be 
used  to  analyze  data  obtained  from  Jupiter  with  Pioneers  F and  G,  as  well  as  from  the  other  outer  planets  in  the 
Outer  Planet  Grand  Tour  missions 


INTRODUCTION 


Since  1965  the  method  of  radio  occultation  has  been  used  to  study  the  atmospheres  of  Mars  and  Venus.  The 
Mariner  4 spacecraft  was  used  to  obtain  information  on  the  atmosphere  of  Mars  in  1965  [Kliore  °t  al.  ,1967;  Kliore 
et  al. , 1968;  Fjeldbn  and  Eshleman , 1968] . In  1967,  the  Mariner  5 spacecraft  flew  past  Venus  and  performed  both 
an  uplink  dual-frequency  and  a downlink  S band  occultation  experiment  [Kliore  et  al , 1967 ; Mariner  Stanford  Group, 
1967;  Fjeldbo  and  Eshleman,  1969;  Klioreet  al. , 1 969;  Fjeldbo  et  al. , 1971].  In  1969,  two  more  Mariner  spacecraft  flew 
by  Mars  and  again  S band  downlink  radio  occultation  experiments  were  performed  [Kliore  etal.,  1969;  1970a,i>; 

Fjeldbo  et  cl.,  1970] . Similar  experiments  will  be  carried  out  with  the  Mariner  Mars  1971  Orbiter,  the  Pioneer  F & G 
fly  by  missions  to  Jupiter,  the  Mariner  Venus-Mercury  tly  by,  and  probably  all  future  Planetary  missions,  iicluding 
the  Outer  Planet  Missions. 

The  analysis  and  interpretation  of  the  data  collected  in  the  radio  occultation  experiments  were  carried  out  on 
the  basis  of  theoretical  foundations  established  earlier  [Fjeldbo,  1964;  Fjeldbo  and  Eshleman,  1965;  Fjeldbo  etal, 

1965  ; Kliore  et  al , 1965] . The  analysis  primarily  involved  the  applications  of  rvmi^l  rav-tracing  methods  under  the 
assumption  of  spherically  symmetrical  refractive  index  distributions.  In  the  case  of  Mars,  an  additional  simplification 
accrued  from  the  very  small  refractive  bending  produced  by  a thin  atmosphere.  The  basic  methods  that  are  presented  here 
have  been  used  in  the  past  to  obtain  index  of  refraction  profiles  from  observed  phase  and  frequency  data.  In  addition, 
some  future  applications  of  these  methods  are  briefly  discussed. 


RADIO  OCCULTATION  OBSERVABLES 


The  quantities  observed  on  earth  during  a radio  occultation  experiment  are  the  frequency  and  amplitude  of  the 
microwave  radio  signal  received  at  a ground  station.  The  amplitude  information  is  important  in  determining  the  exact 
times  cf  loss  and  reacquisition  of  the  signal,  and  it  can  be  used  to  obtain  refractive  index  or  absorption  profiles. 
However,  the  frequency  of  the  signal  forms  the  basic  data  used  to  obtain  the  tadial  distribution  of  refractive  index. 
Since  the  observed  frequency  includes  the  effects  of  spacecraft  motion,  rotation  of  the  earth,  and  so  forth,  it  is 

♦This  paper  represents  the  results  of  one  phase  of  research  carried  out  at  the  Jet  Propulsion  Laboratory,  California 
Institute  of  Technology,  under  NASA  contract  NAS7-100. 
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not  immediately  suitable  for  use  in  analysis.  To  obtain  the  frequency  change  due  only  to  the  effects  of  the  planetary 
atmosphere,  the  predicted  frequency,  based  on  an  estimate  of  the  spacecraft  orbit,  is  subtracted  from  the  observed. 
This  is  the  so-called  “frequency  residual,”  which  is  nothing  more  than  the  time  rate  of  change  of  the  total  phase 
change  caused  by  delay  and  refractive  bending  in  a planetary  atmosphere.  Thus,  in  simplest  terms,  the  inversion 
problem  in  the  reduction  of  occultation  data  is  simply  this:  How  does  one  obtain  a profile  of  refractivity  as  a 
function  of  radial  distance  in  the  planetary  atmosphere  from  data  consisting  of  the  time  rate  of  change  of  the 
observed  total  ph  ise  change  as  a function  of  time? 

To  obtain  an  understanding  of  the  relationship  of  the  observed  phase  change  to  the  index  of  refraction,  it  is  of 
interest  to  examine  first  the  relationship  of  the  phase  path  and  refractive  bending  angle  to  a refractive  index  profile. 
Referring  to  figure  1 , the  angle  of  incidence  / is  related  to  the  ray-path  coordinates  r and  <p  by: 


tani  = rd0/dr 


(1) 


From  Bouger’s  rule,  which  applies  to  spherically  stratified  index  of  refraction  distributions,  one  can  express 
the  ray  asymptote  distance  a in  terms  of  the  index  of  refraction  n and  the  angle  of  incidence  i : 


a - nr  sin  i 


from  figure  1 , it  is  aDparent  that 


y-0  * i-ij)  di p * di+d$ 


(3) 


By  differentiating  Eq.  (1),  one  may  obtain  an  expression 
for  the  differential  di  as  follows: 


di 


a [w+(dn/dr)r]  dr 


nr  [(nr)2 -a2] 


TZTTiTT" 


(4) 


Figure  1.-  Riy  pith  geometry. 


Similarly  by  combining  Eqs.  (1)  and  (2),  one  obtains  an  expression  for  the  differential  dtf> 

d0« 

rUnr^-fl2]1'1 

From  Eqs.  (4)  and  (5),  the  expression  for  d\p  can  be  obtained  as 

a(dn/dr)dr 


di// 


(2) 


(5) 


n [(nr)2 -a2] 1/2 

The  total  bending  angle  a is  then  twice  the  integral  of  d^  along  the  ray  path  as  r goes  from  r0  to  infinity 


(6) 


r (dn/dr)dr 
2aJfo  n [(nr)3 -a1] 1,2 


(7) 
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The  phase  path  length  p along  the  ray  path  can  be  drived  as  follows.  A element  of  phase  path  dp  is  given  by 

dp  = c dr  (8) 

where  c is  the  speed  ot  light  in  free  space  and  dr  s the  differential  time  of  propagation  along  the  ray  path  element 
ds  . Thus,  dp  can  be  expressed  as: 

dp  = c~~  - n ds  (9) 


With  reference  to  figure  1 , it  is  obvious  that 


ds  = [dr3+r3d03] 1/3 


(10) 


Combining  Eqs.  (10)  and  (5),  one  obtains 


nr  dr 

di  = [(nr)3-*3]1'3 

Then,  the  total  phase  path  length  along  the  ray  path,  which  is  equivalent  to  the  apparent  distance  traveled  in  free 
space  (taking  into  account,  however,  the  actual  velocity  of  propagation  in  the  medium,  as  well  as  the  curvaiuie  due  to 
refraction),  is  given  by  twice  the  integral  of  dp  from  r0  to  infinity: 


p(a)  = 


n2r  dr 

[(nrf-a7) 1/2 


(12) 


When  the  frequency  residual  observed  during  a radio  occultation  experiment  is  integrated  from  the  time  of  onset 
of  the  atmospheric  phenomena  to  th : time  of  interest,  the  result  is  the  total  observed  phase  change  as  a function  of 
time: 

<l>(r)  = f A/(t)  dr  (13) 

Jto 

The  relationship  of  this  phase  change  to  the  refraction  phenomena  in  a spherically  stratified  atmosphere  can  be 
explained  with  the  help  of  the  diagram  in  figure  2.  The  circular  arc  of  the  radius  rm  represents  the  limit  of  the  sensible 
atmosphere.  The  value  of  rm  is  arhitary,  the  only  restriction  being  that  the  index  of  refraction  for  r>rm  be  equal 
to  unity.  The  phase  path  change  observed  at  the  earth  is  the  difference  between  the  propagation  phase  path  SA’AE 
and  the  direct  path  to  the  earth  SB'  BE.  This  phase  path  change  for  a one-way  propagation  through  the  atmosphere 
can  be  represented  in  terms  of  the  ray  asymptote  distance  a,  the  refractive  bending  angle  a,  and  the  propagation 
phase  path  length  p as  follows: 


<**00  = j [pm(a)  + (/?3-*3)w3(l-coso(j)  -2(rm3-*3),/3  -a  sin  <*(*)]  (14) 

where  R is  the  distance  from  the  spacecraft  to  the  center  of  the  planet,  pm  is 

Pm  00 

and  X is  the  free  space  waveleng'.h,  or  the  ratio  at 


= 2 


rm  n7r  dr 


[(nr)1 -a7] 


05) 


the  speed  of  light  c to  the  frequency  / . 
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INVERSION  PROCEDURE 


VkS 


To  convert  the  phase  observable  as  a function  of  time  into  a quantity  suitable  for  inversion,  the  trajectory,  or 
ephemeris,  of  the  spacecraft  must  be  brought  into  play.  For  each  time  tf  for  which  the  values  of  A/  and  <!>  are 
available,  the  spacecraft  ephemeris  provides  a position  vector  relafi-,e  to  the  receiving  station  on  the  earth,  a position 
vector  relative  to  the  center  of  the  planet,  and  a velocity  vector ; ! live  to  the  center  of  the  planet.  The  component 
of  the  planet-centered  spacecraft  velocity  in  the  direction  of  the  earth,  one  light  propagation  interval  after  the  time 
tj,  is  given  by 

Ve  = V'E  (16) 


where  V is  the  planet-centered  velocity  in  the  plane  containing  the  spacecraft  and  the  centers  of  the  earth  and  the 
planet,  and  E is  the  unit  vector  in  the  direction  of  the  earth  one  light  propagation  time  after  the  time  r,- . The^ 
Doppler  frequency  tliat  one  would  expect  to  see  at  the  earth  if  propagation  were  10  take  place  in  the  direction  E 
is 

Afc-firy  <'7> 

And  the  angle  \ between  the  velocity  vector  V and  the  vector  in  the  direction  of  the  earth,  is  given  by 
(see  Figure  2) 


*£  = 


cos- 


,!3d_ 


(18) 


However,  the  angle  measured  between  the  velocity 
vector  of  the  spacecraft  and  the  actual  direction  of 
the  ray  that  ultimately  arrives  at  the  earth  after 
undergoing  refraction  in  the  planetary  atmosphere, 
is  given  by  [Phimey  and  Anderson , 1968] . 


Figure  2.-  Geometry  for  the  computation  of  phase  change. 


* = 


cos ' [W\ (Af£+Af) 


(19) 


where  A / is  the  observed  frequency  residual.  Ti  e refractive  bending  angle  a is  then  given  by 


a = tg-'li  *20) 

Ai  the  same  time,  it  can  be  seen  from  figure  2 that  the  ray  asymptote  distance  a can  be  obtained  !;y 


a = R sin  (3 


(21) 


where  0 = a + EFT,  and  EFT  = the  angle  subtended  by  the  earth  and  the  center  of  the  planet  at  the  spacecraft. 

Now  one  has  both  the  value  of  the  refractive  bending  angle  a and  the  ray  asymptote  distance  a for  every  input 
data  point  - or  in  effect,  the  refractive  bending  angle  as  a function  of  the  ray  asymptote  distance.  Furthermore,  one 
can  now  relate  the  observed  total  phase  change  4>  to  the  appropriate  value  of  the  ray  asymptote  distance  a , and 
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hence  one  has  $ as  a function  of  a . It  is  a very  simple  matter  to  also  obtrn  the  expression  for  the  phase  path  change 
in  the  atmosphere  pm . From  Eq.  (14),  it  follows  that 


Pm  (a)  = \4>(a)-(/ii-ii:y  n[  1-cos  a(a)]  + l(rm2-a2)1 11  +a  sin  a(a) 


(22) 


Thus,  one  now  has  computed  the  values  of  the  refractive  bending  angle  a and  the  phase  path  p as  functions  of 
the  ray  asymptote  distance  a . 


Thin  Atmosphere  Approximation 

In  a thin  atmosphere,  such  as  that  of  Mars,  the  refractive  bending  angle  is  very  small,  and  in  that  case  Eq.  (19)  can 
be  written  as  follows: 


cos  4/  = cos 


(23) 


But  since  i p = \p£  + a,  and  a is  very  small 


a - - — — A/ 

iW 


(241 


where  VN  is  the  velocity  of  the  spacecraft  in  the  plane  of  the  spacecraft,  earth  and  planet  centers,  and  normal  to  the 
direction  of  the  earth.  Similarly,  the  equation  for  the  ray  asymptote  distance  a becomes 


a - R(  sin  EPT+a  cos  EPT ) 


(25) 


The  expression  for  the  total  phase  change  can  then  be  written  as 


=~j[Pw(<0-2(r*,J-a2  )vi  -fla(s)]+(/l,-«,)l,,~^ 


(26) 


that 


Now  note  that  the  path  of  integration  A'  to  A in  figure  2 becomes  a straight  line  for  a thin  atmosphere,  such 


/rm 

As 

Hence,  Eq.  (26)  can  be  expressed  as 


(27) 


r0 


m *|p,(a )+(«i-j1),/1^ 


(28) 
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where 


Pi(«) 


(n-i)nr  dr 
[(nr)1-*2]*'2 


, , , ,,  a2(a) 


(29) 


Also,  since  in  a thin  atmosphere  the  refractive  index  n is  very  nearly  equal  to  1 , the  quanity  p,  can  be  expressed 
as 


p,(a)~  2X10~* 


N(r)r  dr 
(r2-a2Y /2 


N = (n-l)XlO6 


Now,  introducing  the  transformation  of  variables 


y - rrn~rl 


w * rm2-a 


7 


( 0) 


(31) 


The  integral  in  Eq.  (30)  can  be  expressed  as 


Pt(w)  - 10"4 


#00  dy 
(w-y)in 


(32) 


This  expression  is  now  in  a form  suitable  to  inversion  by  means  of  the  Abel  integral  transform  [Hamel,  1937] , which 
states  that  if 


/(*v>  * K 


g'OQdy 

(w-y)1'2 


(33) 


then 


g(y ) 


1 f(w)  dw 

kit  (y-w)w2  g° 


Applying  the  transform  to  Eq.  (32),  one  obtain*: 


#00 


104  d f?  p,(w)dw 
ir  d y Jo  (y-w)l/1 


(34) 


(35) 
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and 


rN(r)  * 


apt  (a) da 
(a2-r2)‘  a 


Now  performing  tiie  change  of  variables 


a - r cosh  0 

and  performing  the  differentiation  with  respect  to  r,  the  following  result  is  finally  obtained: 


t3b) 


(37) 


AVI 


a[P|(a)  + fl(dp)(fl)/dfl)]  da 
(a2-r2 )'  2 


(38) 


The  expression  in  Eq.  (38)  was  used  to  produce  the  refractivity  profiles  from  the  radio  occuitation  data  obtained  with 
the  Mariner  6 and  Mariner  7 spacecraft  in  1969.  The  original  data,  as  well  as  some  of  the  intermediate  products  of  the 
Mariner  6 data  reduction,  are  shown  in  figures  3, 4.  and  5;  the  resulting  refractivity  profile  in  figure  6.  The  residuai  fre- 
quency change  A/  for  a two-way  passage  through  the  atmosphere  and  ionosphere  of  Mars  is  shown  in  figure  3.  The 
total  phase  change  is  shown  as  a function  of  time  in  figure  4.  Figure  S shows  the  corrected  phase  path  change  in 
the  atmosphere  of  Mars  (in  km)  for  a one-way  passage  as  the  function  of  the  ray  asymptote  distance  a . The  same 
basic  inversion  method  will  be  used  to  analyze  the  data  obtained  from  the  Mariner  Mars  1971  orbiter. 


An  inversion  procedure  making  use  of  the  thin  atmosphere  approximation  and  an  assumption  of  a spherically 
stratified  atmosphere  divided  into  a finite  number  K of  concenttic  layers  was  used  by  Fjeldbo  and  Eshleman  [ 1968] 
to  invert  the  occuitation  data  obtained  from  the  Mariner  4 spacecraft  in  1965.  Referring  to  figure  7,  Eq.  (30)  can  be 
expressed  as  follows 


106p|(m)  = N(m ) Az(r  ,,n)  + 


) N(n  )Az(m,n ) 

n*l 


(39) 


Where  Pi(m)  denotes  the  phase  path  change  fo.  the  mth  ray,  and  N(m)  is  the  refractivity  in  the  mth  layer. 
The  elemei  ,t  A z(m,n)  is  the  length  of  the  portion  of  the  mth  ray  that  lies  in  the  nth  layer  (figure  7).  The  two- 
dimensional  element  A z is  related  to  the  radii  of  the  atmospheric  layers  by 


(40) 
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REFRACTIVITY.  N.  (n  - ltl  ID6  PHASE  CHANGE.  ♦,  cycles 
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Fig'ire  4.-  Total  phase  change,  Mariner  6 entry. 
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£q.  (39)  can  be  repetivitely  solved  for  N , yielding  the  following  procedure: 


N(\) 


106P,(1) 

dzn 


m) 


106Pi(2)-2iV(1)Az(2,1) 

Az(2,2) 


/V(m)  = 


106p,(m)-2  ^ N(n)Az(mji) 


A z(m,m ) 


(41) 


The  procedure  (4 1)  provides  the  refractivity  of  the  mth  layer  as  a function  of  the  radius  of  the  layer;  figure  8 
shows  the  refractivity  profiles  obtained  from  Mariner  4 radio  cccultaticr.  measurement  ( hjeldbo  and  Eshleman, 

1968)  by  means  of  this  method. 

The  simplified  inversion  procedures  making  use  of  the  thin  atmosphere  approximations  are  applicable  to  the 
analysis  of  data  obtained  frnin  Mars.  However,  for  planets  such  as  Venus  and  Jupiter,  which  have  dense  atmospheres, 

and  in  which  refractive  bending  is  quite  considerable 
(approximately  17°  near  the  super-refractive  level  on 
Venus),  more  general  methods  of  inversion  must  be  used. 


THE  mth  RAY 


General  Methods 

It  has  been  shown  how  the  refractive  bending  angle 
a and  the  phase  path  change  p , can  be  obtained  as  functions 
of  the  ray  asymptce  distance  a from  the  observed  fre- 
quency residual  A/  and  the  ephemeris  of  the  spacecraft. 

Both  a and  p, , as  well  as  a number  of  other  related 
quantities,  can  be  inverted  to  obtain  the  refractive  index 
profile  by  means  of  the  Abel  integral  transform.  Let  us 
first  consider  the  inversion  of  the  refractive  bending 
angle  a . 
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Fi|ure  7.-  Cross  section  of  atmosphere  consisting  of  K Figure  8.-  Mariner  * refractivity  profiles 

spherical  layers  (From:  Fjtldbo  tnd  Ethltman , 1968).  [From:  Ffeldbo and Ethleman,  1968 1. 
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In  a method  described  by  Fjeldbo  et  a/.  [1971] , Eq.  (7)  is  first  transformed  by  replacing  the  product  nr  by 
a variable  of  integration  x : 


a(j)  = -2a 


r 


dn  dx 


ndx  (x2-a2),/J 


142) 


Now  letting  a,  denote  the  ray  asymptote  distance  for  a closest  approach  distance  rol  , we  multiply  both  sides 
of  Eq.  (42)  by  the  quantity  (a2-«,2),/2  and  integrate  with  respect  to  a from  a,  to  infinity.  This  yields 


r 


a(a)da 

(7vr‘ 


(43) 


The  term  on  the  left-hand  side  can  be  integrated  by  parts,  resulting  in 
n(r0. ) = exp 


(44) 


which  is  an  expression  for  the  refractive  index  at  the  closest  approach  radius  of  the  ray  r0l  given  by 


•Zl 

rt(fi>l) 


(45) 


The  phase  path  change  ftm  can  be  inverted  in  a similar  manner  [Phinney  and  Anderson,  1968] . Again  letting 
x - nr , Eq.  (1 5)  becomes: 


/'m  XJ  dr 

wU*1 


(46) 


Now,  performing  the  substitution  of  variables: 


y - rm‘-x'  W - rm2  -a1 


(47, 


One  obtains  the  following  expression  for  p . 


Pm(w) = 2 J 
0 


W(rm2-y)  [d(lnr)/dy]  d y 


(w-y) 


1/2 


(48) 


Again  making  use  of  the  Abel  integral  transform  [Eqs.  (33)  and  (34)]  one  has: 
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(49) 


*'(>0  = (rm1~y) 


d(lnr) 

d>- 


d 1 ry  /(") 

I T7T  dw 

d y InJ  (y-w)1'1 
L o 


or.  reverting  to  the  original  variables  x and  a. 

x 2 d(lr.r)  fX  aPm(a)da~l 

d*  = d*[*/m  (a'-x')'''\ 


(50 ; 


It  can  be  shown  \Phinneyand  Anderson , 1968)  that  this  expression  ultimately  yields 


r(jc)  = rm  exp 


a 1 dpM 
cosh  


x a 


da 


(51) 


and  finally,  the  index  of  refraction  as  a function  ot  r (the  closest  approach  distance  of  the  ray)  can  be  obtained 
by 


(52) 


We  liave  seen  that  both  the  refractive  bending  angle  a and  the  phase  path  change  p,  can  be  inverted  to  yield 
the  refractive  index  n as  a function  of  the  closest  approach  distance  of  the  ray  r by  means  of  the  Abel  integral 
transform,  under  the  assumption  of  a spherically  symmetrical  index  of  refraction  profile. 

Ray-tracing  inversion  may  also  be  used  to  obtain  t!..  index  of  refraction  profile  from  data  consisting  of  the 
refractive  bending  angle  as  a function  of  ray  asymptote  distance  [Fjeldbo,  etaL , 1971] . 


Figure  9 shows  the  geometry  for  a ray  that  is  being  re- 
fracted in  an  ionosphere  consisting  of  K concentric  layers. 
If  rn j and  nm  are  the  radius  and  the  index  of  refraction 
respectively,  for  the  mth  layer,  then  the  radius  of  closest 
approach  rom  of  the  mth  ray  is  given  by 

1 

rom  ~ ^ (rm+,,m+l)  (53) 

If  im  i(  is  the  angle  of  incidence  and,  jm  ^ is  the 
angle  of  refraction  at  each  boundary,  then  the  law 
of  sines  and  Snell’s  law  applied  at  the  first  boundary 
gives  the  following  equations  for  the  mth  ray: 

am  = rx  sinim>1  (54) 

sin  im,  i = sin  jm  i (55) 


Figure  9.-  Refraction  of  ray  path  by  concentric  spherical  j 

layers  - index  of  refraction  increasing  with  height  i//m  = — , (5  b) 

1 Frc  r.:  Fjeldbo,  Kliore,  and  Eshleman,  1971).  2 
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where 


am  = ray  asymptote  distance  for  the  with  ray 
a m = total  angle  through  which  the  mth  ray  is  bent  by  the  atmosphere 

4im,  i = the  angle  which  the  mth  ray  must  be  bent  before  it  is  traced  from  the  first  layer  to  the  closest 
approach  point. 

Generalizing  to  the  Jtth  boundary  gives 


'VlsinW-l  = rk sin  im,k 

(57) 

nk-\  sin  im,k  = nk  sin  >m.k 

(58) 

^ m,k  ~ ^m,k-\+  'm,k~im,k 

(59) 

The  data  giving  a as  a function  of  a , and  the  equations  given  above  may  be  used  to  determine  the  refractive 
index  n as  a function  of  r for  each  layer.  For  a ray  passing  above  the  atmosphere,  the  bending  angle  is  zero  and  the 
radius  of  closest  approach  is  equal  to  the  asvmptote  distance.  For  this  layer,  then, 

1 

«o  = l'»  r0  = — (a0+a- ,)  (60) 

where  a0  and  a_1  denote  the  asymptote  distances  for  the  rays  that  pass  immediately  above  the  first  atmospheric 
layer.  Then  from  Eq.  (51)  the  radius  of  the  first  layer  is: 


r i = 2a0-r0 


(61) 


Setting  m equal  to  1 and  ^)>0  equal  to  a,/2  in  the  ray  equations,  we  can  calculate  the  refractive  index  for  the 
first  layer: 


r . /<*!  . tfAi 

n,  = tfi  r,  sinl — + sin  * — II 

L '2  r«/J 


(62) 


Hie  radius  of  closest  approach  for  the  first  ray  r0l  is  given  by 


rom  ~ rm  sin  im,m  , m = \ 


(63) 


Again  applying  Eq.  (53)  with  m = 1 , we  can  determine  the  radius  of  the  second  layer  r2 . Continuing  these  computa- 
tions for  the  remaining  rays,  the  entire  refractive  index  profile  of  the  atmosphere  is  obtained.  For  instance,  the  radius 
rm  and  the  refractive  index  nm  of  the  mth  layer  are  determined  from  the  (m-l)st  and  the  mth  ray,  respectively. 
The  general  expressions  are  of  the  form 


rm  a 2r0m-rm-i 


(64) 
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(65) 


*'  Vjj*v 


nm  ~ nm-\ 


rm-\  sin  im,m-\ 

rm  sin | + sin  1 [(rm-<lrm)  sin 


With  the  introduction  of  additional  mathematical  complexity,  the  ray-tracing  inversion  method  can  be  generalized 
to  include  cases  in  which  the  refractive  index  is  allowed  to  change  in  both  vertical  and  horizontal  directions.  However, 
when  only  a single  occultation  is  observed,  the  horizontal  changes  must  be  specified  before  the  data  can  be  utilized  to 
determine  the  vertical  refractive  index  proftie. 


Figure  10.-  Refractive  bending  angle  versus  the  altitude  at  the  ray 
asymptote  for  Mariner  S entry,  computed  from  423.3  MHz 
amplitude  data  [ From:  Fjeldbo,  Kliore  and  Eshleman,  1971). 
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Figure  1 ' - Refractivity  and  number  density  profile  for  Mariner 
5 er  t y | From:  Fjeldbo,  Kliore  and  Ethleman,  1971). 


Fjeldbo  has  applied  the  ray-tracing  inversion  method 
to  analyze  the  data  from  the  occultation  of  Mariner  5 by 
Venus  [Fjeldbo,  etal.,  1971] . The  refractivity  profile 
resulting  from  the  inversion  of  the  Mariner  5 Venus  night- 
side  data  is  shown  in  figure  1 1 . 

Mathematical  techniques  similar  to  those  outlined 
for  the  inversion  of  radio  occultation  data  from  planets 
having  dense  atmospheres,  already  applied  to  Venus 
occultation  data,  will  be  used  in  the  future  to 
analyze  the  data  obtained  at  Venus  during  the  Venus- 
Mercury  1973  mission,  and  from  Jupiter  during  the 
Pioneer  F and  G missions,  as  wt’l  as  the  Outer  Planet 
Grand  Tour  Missions  which  are  pianned  for  the  latter 
part  of  this  decade  and  the  1980’s. 


AMPLITUDE  DATA 

Amplitude  data  collected  during  a planetary  occulta- 
tion also  contain  other  valuable  information.  If  the  fre- 
quency is  low  enough  that  absorption  effects  can  be 
neglected,  the  amplitude  data  can  be  assumed  to  contain 
information  only  about  refractive  defocusing  and  a pro- 
cedure exists  to  deduce  the  ray  bending  angle  a and  the 
ray  asymptote  parameter  a.  Fjeldbo  et  al.,  [1971]  have 
shown  that  starting  with  rays  that  graze  the  top  of  the 
atmosphere,  the  amplitude  measurements  can  be  used 
point  by  point  to  solve  for  the  differential  change  in  the 
angle  of  refraction  Aa  with  decreasing  ray  asymptote 
altitude.  Integrating  Aa  yields  a as  a function  of  a . 
Figure  10  shows  an  example  of  a refractive  bending  angle 
profile  obtained  in  this  manner  from  the  Mariner  5 Venus 
423.3  MHz  amplitude  data. 

On  the  other  hand,  when  measurements  are  made 
at  a frequency  at  which  absorption  effects  in  a planetary 
atmosphere  are  apparent,  the  amplitude  data  c?n  be  used 
in  z conjunction  with  the  results  of  the  previous  section 
to  compute  a profile  of  the  absorption  coefficient  in  the 
atmosphere. 
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It  can  be  shown  from  the  geometry  of  figure  2 that  the  attenuation  due  to  defocusing  in  the  planetary  atmosphere 
(in  dB)  is  given  by 


■Apia)  = IQ1°8io 


(66) 


where 
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The  refractive  bending  angle  a as  a function  of  a is  available  from  the  techniques  described  in  the  first  section. 
Since  the  observed  total  attenuation  A j is  available  as  a function  of  time,  it  can  be  expressed  as  a function  of  the 
ray  asymptote  distance  a by  comparison  of  the  computed  value  of  a to  the  time  of  receipt  of  the  frequency  data. 
Thus  the  excess  attenuation,  or  loss  Ag,  is  simply  the  difference  between  the  total  observed  attenuation  A j and 
the  computed  defocusing  attenuation  Ajy.  Now,  the  excess  attenuation  due  to  absorption  along  the  ray  path  can 
be  described  as  follows: 


A£(a)  = (10  log, o e)r(a) 


(68) 


or 
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where  o(r)  is  the  absorption  coefficient 

By  means  of  ray-tracing  inversion  discussed  in  the  previous  section,  the  quantity  r for  the  mth  ray  having  a 
closest  approach  radius  of  rom  (fig.  9)  can  be  expressed  as 
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In  the  first  layer 


Ti  = 2Az1)1<7i  +2Azi)q00 
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Finally,  generalizing  to  the  mth  ray, 

m- 1 

Tm-2  £ ^*0* 

om  4=0 (75) 

2 Azm.m 

The  quantities  r0/(,  and  <m,fc+l  are  comPuted  at  each  step  from  Eqs.  (57),  (58),  (59),  (63),  and  (64). 

A similar  method  has  been  used  by  Fjeldbo,  et  al.  [1971]  for  computing  the  absorption  coefficient  in  the  lower 
atmosphere  of  Venus  using  the  Mariner  5 S band  attenuation  data.  It  is  expected  that  an  application  of  similar  methods 
to  the  S band  data  derived  from  the  Pioneer  F and  G Jupiter  missions  will  yield  profiles  of  ammonia  abundance  in  those 
portions  of  the  Jovian  atmosphere  probed  by  the  Pioneer  radio  occultation  experiment. 
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DISCUSSION 


Parker:  If  there  is  significant  absorption  in  the  atmosphere,  the  dispersion  relations  will  imply  phase  distortions. 

Doesn’t  this  affect  your  refractive  index  profile? 

Kliore:  Yes.  It  would  actually  affect  the  temperature  and  pressure  profiles. 

Parker:  If  there  is  significant  absorption,  the  phase  distortion  is  due  to  two  effects:  pure  refraction  and  a 

dispersion  relationship. 

Kliore:  That  is  true.  I don’t  know  the  magnitude  of  the  effect,  but  it  will  show  up  in  the  profile  and  the  profile 

will  be  difficult  to  interpret. 

Bojanki:  If  you  have  absorption  and  phase  data  shouldn’t  you  combine  them  and  pose  the  inverse  problem  for 

a complex  index  of  refraction? 

Kliore:  That  would  be  difficult  since  amplitude  data  include  effects  of  defocusing  and  absorption. 

Reinisch:  When  you  calculated  the  phase  path  change  you  took  the  difference  between  the  refracted  and  vacuum 

paths  forming  the  difference  under  the  integral  sign.  You  should  take  the  difference  between  two  integrals. 

Kliore:  There  is  only  a notation  mistake.  We  do  what  you  suggest. 

Unidentified  Speaker:  You  have  described  two  inversion  methods  - the  Abel  transform  and  the  Shell  method. 

Have  you  compared  these? 

Kliore:  They  are  quite  similar.  Each  data  point  is  treated  as  the  origin  of  every  shell. 

Unidentified  Speaker:  What  is  relative  importance  of  defocusing  and  absorption? 

Kliore:  Venus  is  our  only  source  of  data.  De  ocusing  predominates.  There  is  no  appreciable  absorption  down 

to  where  the  defocusing  distortion  is  about  30  dB.  This  continues  until  the  sum  of  the  two  is  about  45  dB. 
The  maximum  amount  of  absorption  is  about  6 dB. 

Whitten:  We  looked  at  the  effect  of  ionospheric  absorption.  At  your  frequencies,  the  effect  is  negligible. 

Kliore:  We  have  not  specifically  looked  at  that.  Perhaps  it  would  be  important  at  Jupiter. 

Rodgers:  Do  you  have  a well-conditioned  linear  problem,  and  can  you  get  out  as  many  independent  points  as 

you  measure? 

Kliore:  Yes.  The  problem  is  solving  a first-order  linear  differential  equation.  The  discrete  method  is  essentially 

a matrix  inversion.  The  noise  problem  is  manageable. 

Unidentified  Speaker:  In  fact,  can  it  be  written  just  as  the  inversion  of  an  upper  triangular  matrix? 

Kliore:  Yes. 
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ERRORS  INCURRED  IN  PROFILE  RECONSTRUCTION  AND 


METHODS  FOR  INCREASING  INVERSION  ACCURACIES  FOR 
OCOULTATION  TYPE  MEASUREMENTS 

S.  H.  Gross  and  J.  A.  Pirraglia 
Polytechnic  Institute  of  Brooklyn 

N 7 3 ABSTRACT 


The  atmosphere  and  ionsphere  of  a planet  varies  with  latitude,  longitude,  altitude,  and  time.  The  oecultation 
experiment  scans  the  medium  roughly  along  the  locus  of  nearest  points  of  the  ray.  The  present  method  of  measuring 
and  inverting  the  data  is  satisfactory  if  this  locus  is  along  a vertical,  a condition  that  is  closely  approximated  when 
the  motion  of  the  spacecraft  about  the  planet  is  in  or  close  to  the  ecliptic.  When  inclined  to  the  ecliptic  (as  for  example, 
an  orbiter.  to  take  advantage  of  progression  or  retrogression  of  the  orbit),  the  resulting  profiles  are  considerably  in  error 
with  respect  to  the  vertical  profiles  over  the  oecultation  points.  Even  along  the  locus  of  nearest  points,  significant  errors 
occur  about  and  below  the  peak  of  the  ionosphere.  The  resulting  profile  along  the  slant  locus  is  difficult  to  interpret  in 
terms  of  the  physical  thr  ories  of  atmospheres  or  ionospheres.  Proper  interpretation  requires  the  components  of  ‘.he 
gradient  with  radius  and  angles. 

A method  for  augumenting  the  oecultation  experiment  is  described  for  slightly  refractive  media.  This  method, 
which  permits  separation  of  the  components  of  the  gradient  of  refractivity.  appears  applicable  to  most  of  the  planets 
for  a major  portion  of  their  atmospheres  and  ionospheres.  The  analytic  theory  is  given,  ana  the  results  of  numerical 
tests  with  a radially  and  angularly  varying  model  of  an  ionosphere  are  discussed. 


INTRODUCTION 


As  Mariners  4 through  7 moved  behind  the  planets  Venus  and  Mars,  they  provided  information  on  the  verfi-al 
distribution  of  ionization  and  neutral  gases  above  points  of  oecultation  [Fjeldbo  and  Eshleman , 1968;  Kliore  et  al. . 
1965.  1967.  1969],  The  vehicles  moved  in  planes  close  to  the  ecliptic,  and  as  shown  by  Pirraglia  and  Gross  [1970], 
radial  profiles  should  be  expected  for  such  paths. 

Actual  atmospheres  and  ionospheres  vary  with  latitude  and  longitude,  the  time  of  the  planet’s  day.  anti  altitude 
or  radius.  Therefore,  more  complete  comprehension  of  the  physics  and  properties  of  the  media  requires  information 
on  the  variations  with  all  positional  coordinates  and  time.  If  attempts  are  made  to  obtain  such  data  from  spacecrafts 
in  oroits  inclined  to  the  ecliptic,  using  progression  or  retrogression  of  the  orbital  planes  to  advance  or  retard  the  points 
of  oecultation.  then,  as  shown  by  Pirraglia  and  Gross  [1970] , the  presently  constituted  experiment  yields  results 
considerably  in  err.r  if  interpreted  as  vertical  profiles  over  the  oecultation  points.  At  best,  the  results  represent  a 
somewhat  inaccurate  measurement  of  the  refractivity  profile  along  a slanted  locus  of  nearest  points  to  the  planet  of 
rays  between  the  observation  point  on  Earth  and  the  spacecraft.  The  greater  the  inclination  of  the  orbit  and  the 
angular  variation  of  the  medium,  the  greater  is  the  error  The  effects  of  angular  dependence  enter  because  the  radia- 
tion is  scanning  the  medium  in  angle  as  well  as  radius  as  a result  of  the  spacecraft’s  inclined  trajectory. 

Even  if  refrac’ivity  information  along  a slant  path  is  considered  adequate  and  sufficiently  accurate,  it  is  presently 
of  limited  use  because  most  theoretical  work  on  atmospheres  and  ionospheres  has  been  concentrated  on  the  nature 
and  interpretation  of  vertical  profiles.  Adequate  theorv  would  be  needed  for  .omprefiead’ng  and  interpreting  combined 
vertical  and  horizontal  variations.  Difficulties  could  be  encountered  with  special  situations,  s"ch  as  when  (he  slant  locus 
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lies  tangent  to  surfaces  of  constant  refractivity.  Since  the  medium  is  not  stationary  in  time,  these  problems  are  not 
alleviated  by  the  accumulation  of  data  from  numerous  occultations  of  an  orbiter  in  slightly  different  regions. 

The  effect  of  angular  variation  of  refractivity  on  the  occultation  experiment  depends  on  the  ratio  of  the  planet’s 
radius  to  the  scale  height.  The  larger  this  ratio,  the  less  important  is  the  contribution  of  latitudinal  and  longitudinal 
gradients  on  the  measurement.  This  is  because  the  angular  scan  of  the  nearest  point  locus  is  smaller.  Since  the  lower 
neutral  atmosphere  usually  has  a scale  height  that  is  much  smaller  than  ionospheric  scale  heights,  it  is  expected  that 
difficulties  will  be  greater  for  ionospheres.  If  angular  gradients  in  the  neutral  atmosphere  are  greater  in  the  ratio  of 
the  scale  heights  of  the  two  regions  of  the  medium  surrounding  a planet,  then  the  difficulties  will  be  comparable.  Deep 
neutral  atmospheres,  such  as  for  Venus  and  very  likely  for  Jupiter  and  Saturn,  produce  large  refractive  effects  that  cause 
considerable  bending  of  the  rays,  and  may  cause  skewing  and  torsion.  The  locus  of  nearest  points  may  be  considerably 
distorted  and  extend  the  angular  scan  of  the  medium. 

It  i:  desirable  to  measure  the  radial  and  angular  components  of  the  gradients  of  refractivity.  These  components, 
measured  for  each  point  along  the  locus  of  nearest  points,  would  at  least  permit  some  reconstruction  of  the  profile 
along  radial  or  vertical  lines  at  various  places  along  the  locus  A complete  reconstruction  would  require  the  determina- 
tion of  higher  order  derivatives  as  well.  Although  in  principle  these  derivatives  are  measurable,  they  would  be  exceed- 
ingly difficult  to  determine  in  practice.  Thus,  only  first-order  partial  derivatives  may  be  sought,  and  reconstruction  is 
reasonable  only  to  the  extent  that  expansion  to  first-order  terms  is  acceptable.  In  this  sense,  the  medium  would  be 
best  if  it  were  slowly  varying  with  angular  coordinates.  Rapid  changes  with  angle  of  heights  of  separate  distinct  layers 
would  cause  difficulties.  At  worst,  knowledge  of  the  components  of  the  gradients  would  be  useful  by  itself  for  more 
complete  theoretical  treatment  of  the  likely  physical  processes  controlling  the  regions. 

Here  we  present  a method  for  measuring  both  radial  and  angular  gradients  by  means  of  a modified  occultation 
experiment.  The  method  utilizes  simultaneous  scans  of  the  medium  along  slightly  different  loci.  It  has  only  been 
applied  to  slightly  refractive  media.  Some  of  the  ideas  may  be  applicable  to  highly  refractive  regions,  but  such  cases 
require  further  study.  One  must  utilize  dispersion  foi  scanning  on  slightly  different  loci.  For  an  ionosphere,  frequency 
dispersion  is  readily  available.  Foi  neutral  media,  such  dispersion  is  only  possible  at  millimetric  waves  where  the 
experiment  would  suffer  from  absorption.  To  avoid  severely  attenuated  signals  much  lower  frequencies  must  be  used 
where  frequency  dispersive  effects  are  negligible.  Dispersion  must  then  be  spatial,  as  may  be  possible  with  closely  spaced 
radiators  emitting  at  different  frequencies  to  avoid  interference  effects.  It  should  be  noted  that  an  experiment  has  been 
suggested  for  performing  occultations  between  two  spacecaft  rather  than  to  eaith,  which  has  some  merits  in  view  of  the 
difficulties  described  here  [Gross/  eial.,  1969] . 

The  analysis  presents  the  theory  of  the  me  thod  of  measurement  as  applied  to  an  ionosphere  model  simulating 
a medium  wnh  both  angular  and  radial  variation.  Exact  values  for  the  refractivity  are  known  and  are  compared  with 
computed  values  deduced  from  simulated  measurements.  The  results  of  these  calculations  are  presented.  The  angular 
derivatives  transverse  to  the  rays  are  most  easily  obtained.  Angular  derivatives  along  the  ray  are  much  more  difficult 
to  measure,  if  the  locus  of  nearest  points,  from  one  end  to  the  other,  does  not  cover  too  large  a change  in  the  angular 
coordinate  parallel  to  the  rays,  the  determination  of  these  derivatives  is  not  essential.  For  media  that  do  not  extend 
too  far  from  the  planet  (relative  to  the  radius  of  the  planet),  the  major  angular  change  appears  to  be  transverse  to  the 
rays,  a situation  that  may  be  characteristic  for  real  ionospheres. 

THEORY 

T he  residual  Doppler  shift  of  a signal  passing  through  a medium  from  a moving  spacecraft  depends  on  all 
components  of  the  gradient  of  refractivity  of  the  medium.  If  the  contributions  of  the  radial  and  transverse  components 
are  separable,  it  is  possible  to  obtain  approximations  to  these  components  of  refractivity  by  means  of  Abel  transfor- 
mations. The  refractivity  and  its  derivatives,  so  obtained,  are  properties  of  the  medium  at  the  nearests  points. 

When  this  locus  is  close  to  a vertical  to  the  planet’s  surface,  the  profile  obtained  by  direct  inversion  of  the  residual 
Doppler  data  is  a good  approximation  to  the  vertical  profile,  with  the  possible  exception  of  complications  below  un 
ionization  peak,  or  for  regions  of  large  gradients.  Such  direct  inversion  is  the  procedure  based  on  spherical  symmetry 
that  has  been  applied  to  the  Mariner  experiments  [Fjeldbo,  1964;  Fjeldbo  et  al. , 1965  , Fjeldbo  and  Bhlemart,  1965] . 
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When  the  locus  deviates  appreciably  from  a vertical  line,  the  assumption  that  the  derived  data  is  a vertical  profile  is 
incorrect,  as  shown  by  Pirraglia  and  Gross  [1970] , for  media  with  transverse  gradient  components  and  even  for  media 
with  large  radial  components  of  the  refractivity  gradient. 

In  the  occultation  experiment  one  may  view  the  locus  of  nearest  points  as  a line  scanned  by  the  experiment,  the 
scanning  being  accomplished  by  the  motion  of  the  spacecraft  along  its  trajectory.  None  of  the  components  of  the  gradient 
of  refractivity  may  be  separated  from  one  such  line,  unless  it  is  a coordinate  line,  such  as  a .adial  or  a circle  of  constant 
radius.  The  only  derivable  information  is  an  approximate  directional  derivative  along  the  locus. 

One  may  appreciate  in  a qualitative  sense  that  it  is 
possible  to  obtain  information  on  the  components  of  the 
gradient  by  simultaneously  utilizing  more  than  one  scanning 
line  with  small  separations  between  such  lines  These  lines 
as  viewed  from  the  observation  print  appear  to  be  prov  ed 
into  a plane  perpendicular  to  the  line  of  observation  (Fig.  1 ). 
The  spacing  between  corresponding  points  on  these  lines 
can  be  related  to  the  radial  component  of  the  refractivity 
gradient  and  an  angular  component  of  the  gradient,  th 
component  transverse  to  the  radial  component  as  seen  in 
the  projected  plane.  In  terms  of  a cylindrical  coordinate 
system  with  the  nc'ar  axis  along  the  direction  of  observat.-1;, 
the  ir  axis,  and  the  coordinates  of  the  plane  of  projec  tion 
designated  pr  and  dy,  these  components  of  the  refractivity 
would  be  bN/dpf  and  1/pr  bN/bt^.  The  component 
3A^/dzr,  which  is  essentially  parallel  to  the  rays,  appear*  to 
be  poorly  defined  by  the  loci. 

bat  one  may  obtair  the  component  bN/bzr,  but  the 
sensitivity  required  to  accomplish  this  is  far  greater  than  for  the  other  components.  In  principle,  the  more  loci  scanned 
the  greater  the  number  and  order  of  derivatives  that  may  be  obtained.  For  slightly  refrartive  media  it  turns  out  that  zr 
over  the  full  sweep  of  the  loci  barely  changes,  so  that  the  principal  changes  occur  in  the  coordinates  pr  and  4>r  ■ 

The  component  3JV/90r  is  of  great  value  in  reconstructing  a profile  along  a vertical,  as  well  as  in  providing  informa- 
tion on  the  nature  of  the  change  of  the  medium  with  angle.  It  is  of  interest  that  if  the  loci  lie  tangent  to  contours  of  con- 
stant refractivity,  determination  of  bN/bpr  and  bN/b<t>r  will  clarify  this  situation. 

The  denvation  of  analytic  expressions  requires  the  assumption  that  the  medium  has  mall  refract:  , lty  and  that  the 
index  of  refraction  p may  be  expressed  as 

p = 1+eN  (1) 

where  eN  is  the  refractivity  («1).  e is  a small  constant  parameter,  and  N is  the  part  of  the  refractivity  that 
contains  the  dependence  on  position  coordinates  r , 6 , 0.  N may  be  of  the  order  of  unity  in  regions  of  maximum 
medium  density  but  may  decrease  exponentially  as  density  decreases. 

It  is  also  assumed  t^t  the  variation  of  N with  0 and  0 is  small  enough  that  N may  be  expanded  in  a Taylor 
series  about  some  coordinate  angles  00 , 0O , and  in  a restricted  region  about  these  values  the  expansion  may  be  cut 
off  at  the  first-order  derivative  teuns. 

Though  these  assumptions  are  needed  to  permit  the  derivation  of  analytic  expressions,  the  method  is  applicable 
to  a wioer  range  of  parameters  when  treated  by  compute  ional  techniques.  The  Taylor  series  expansion  need  not  be 
cut  off  at  the  first-order  derivatives.  Higher  order  derivatives  may  be  analytically  treated  by  utilizing  additional 
measurements,  but  the  practicality  of  such  measurements  coi<d  pres  nt  severe  equipment  problems. 

If  R(t)  is  the  radius  vector  of  the  spacecraft’s  path,  measured  from  some  convenient  origin,  such  as  the  planet’s 
center,  the  residual  phase  length  of  the  rays  outside  the  region  of  :he  caustic  h*s  a one-to-one  association  with 


Figure  1.-  Nearest  point  loci.  The  loci  of  nearest  points  of  two  sets 
of  rays  are  shown  projected  into  the  plane  perpendicular  to  the 
direction  of  observation. 

This  pictorial  explanation  is  not  exact,  and  it  turns  out 
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the  vector  function  R and  may  be  expressed  as  A<t>  [R(r)] . For  these  conditions  there  is  also  a single-valued  relationship 
between  points  on  the  locus  of  nearest  points  r„  (f),  and  the  path  of  the  spacecraft,  so  that  A4>  = A4>[rn(r)] . The  residual 
phase  length  may  then  be  expressed  either  in  terms  of  the  coordinates  of  the  spacecraft’s  position  or  in  terms  of  the 
coordinates  of  the  nearest  points  of  the  rays. 


The  position  of  the  spacecraft  will  be  specified  in  terms  of  the  cylindrical  coordinate  system  that  has  been 
defined.  The  spacecraft's  coordinates  are  designated  ps , <ps , z,;  4>s  is  measured  from  some  convenient  xr  axis 
such  as  the  ecliptic  north  direction,  assuming  that  zr  axis  lies  in  the  ecliptic  plane.  (One  may  readily  modify 
these  definitions  when  a planet’s  position  out  of  the 
ecliptic  plane  is  taken  into  account.)  This  coordinate 
system  is  considered  fixed  for  the  occultation.  (Changes 
in  the  earth’s  position  are  not  important  for  the  presenta- 
tion here.)  Another  coordinate  system  may  be  defined 
for  the  nearest  point  of  the  ray  emanating  from  or  being 
received  from  the  spacecraft  in  any  one  position  along 
its  trajectory.  This  coordinate  system  is  rotated  relative 
to  the  fixed  system.  Its  polar  or  z-axis,  designated  zn , 
is  parallel  to  the  tangent  to  the  ray  at  the  nearest  point 
and  is  in  the  same  general  direction  as  the  direction  of 
observation.  Its  x-axis,  designated  xn , is  the  radial  from 
the  center  of  the  planet,  the  origin  of  this  coordinate 
system,  through  the  nearest  point.  The  radius  to  the 
nearest  point  is  designated  rn , and  this  axis  is  located 
relative  to  the  fixed  axes  by  an  azimuthal  plane  rotation 
<t>’n  about  the  zr  axis  and  by  a rotation  Q‘n  in  this  plane. 

The  coordinate  systems  and  their  relationship  are  shown 
in  figure  2.  The  coordinates  r„,  0'n,  4>n  for  the  nearest 


point  is  actually  spherical  coordinates  for  the  nearest 
point  with  0 the  complement  of  the  usual  polar  angle. 

The  spacecraft  position  vector  R(r)  is  designated 
by  the  coordinates  ps , <j>5 , z, . In  place  of  time  ; , one 
may  represent  this  position  vector  parametrically  in  terms 
of  one  of  the  coordinates.  Any  convenient  one  may  be 
used,  since  in  an  occultation  it  is  presumed  that  the  osition  of  the  spacecraft  is  known  relative  to  the  planet.  Here 
we  choose  to  use  ps , though  in  a circular  orbit  0,  .nay  be  preferred.  The  choice  is  not  important  for  the  theoretical 
presentation.  Thus,  we  take  p,,  0,  = 0,(p,)  and  zs  * z,(p,)  as  known.  The  nearest  point  position  vector  r n(t) 
may  be  described  in  terms  of  its  radius  rn  and  the  rotation  angles  6'n , 4>'n  relative  to  the  pr , 0r,  zr  coordinate 
system.  Then  differentiation  of  A4>[r„(r)]  * Ad>  [R(r)]  leads  to 


Figure  2.-  Occultation  geometry.  The  coordinate  system  xryrir 
is  the  system  determined  by  the  direction  of  observation.  The 
xn},nzn  coordinate  system  is  determined  by  the  nearest  point 
of  a ray  and  the  direction  of  the  ray  tangent  at  the  nearest 
point.  The  xr»zr  coordinates  are  fixed  during  an  occultation 
while  the  x^y^n  moves  with  the  rays  describing  the 
occultation. 
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The  function  d/dps  (A<1>)  = d,  df  (A<J>)/(dps/df)  is  known,  since  its  numerator  is  obtained  from  the  measured 
residual  Doppler  shift  and  the  denominator  from  the  known  path  of  the  spacecraft.  If  b/brn  (A4»)  d/ (A4>) 
and  d/d0n  (A4>)  are  found  as  functions  of  rn , and  if  suitable  functional  relationships  are  found  between  the 
medium’s  refractivity  and  the  phase  path  functions,  then  bN/brn,  bNlbO'n,  bSIb^  may  be  determined  from 
Abel-type  transformations  as  will  oe  shown. 

The  refractivity  is  now  assumed  to  be  expressed  as  a Taylor  series  about  some  angular  coordinates  60,  <p0, 
where  dQ  is  the  complement  of  the  usual  polar  angle  in  a spherical  coordinate  system.  The  dependence  on  radius 
is  maintained  implicitly  in  all  terms.  The  coordinates  d0,  <t>0  correspond  to  values  in  the  vicinity  of  the  nearest 
point  of  a ray.  Then,  cut  off  at  the  first  derivatives, 


N(r,d'n,<t>'„)  = N(r,eo,<t>0 ) + 


bN(r,6,<t> ) 


bd 
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The  derivatives  of  N from  (4)  are 
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bN(r,Q'n,<t>'n)  bN{r,d,4>) 

bd'„  ~ bd 

d=6o 
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bN(r,e'n,4>'n)  31V(r,g,0) 
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6=60 
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For  media  in  which  the  angular  dependence  of  the  refractivity  it  small  any  individual  ray  with  nearest  point 
coordinates  rn,  d'n,  0 ^ will  be  well  approximated  by  a ray  in  a fictitious  medium  described  by  the  actual  medium 
along  the  radial  line  0'n  and  <t>'n  equal  to  constants.  Then  to  first  order  in  e the  residual  phase  length  may  be 
expressed  by  integrating  along  a straight  line  tangent  to  the  ray  at  the  nearest  point  [Pirraglia  and  G.oss,  1970] : 


(8) 


where  w is  the  frequency,  c is  the  vacuum  velocity  of  light,  ds  is  an  element  of  length  along  the  path  and  s,  the 
distance  along  the  path  measured  from  the  nearest  point,  is  given  by  y/r*  -r„2  . The  factor  2 takes  into  account 
the  near  equal  contribution  from  both  sides  of  the  ray  path  about  the  nearest  point.  The  upper  limit  of  the  integral, 
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which  is  the  value  of  s at  the  sensible  limit  of  the  medium,  is  set  equal  to  infinity,  an  approximation  valid  for 
exponentially  decreasing  media  or  refractivity. 


Taking  derivatives  of  (S)  with  respect  to  and  f(|>  and  using  (S),  (6),  and  (7), 


If  the  left-hand  sides  of  (9),  (10),  and  (1 1)  can  be  found,  then  the  components  of  the  gradient  op  A'  m<iy  be  deter- 
mined by  inversion. 

Defining  the  functions: 
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(3)  may  be  rewritten 


dpt  r,  _,d/t(rM) 

f(Ps)  “ eg(rn)  + e[9„(rK)~e^ 


dr, 
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+ e 


K ('«)'*<?] 


df(rn) 

dr„ 


dd'n  d <(>'„ 


*(rn) 


(16) 


(16)  is  not  sufficient  to  find  ;,/t,  and  e,  additional  equations  being  needed  from  the  ray  path. 

Pirraglia  and  Gross  [1970]  gave  the  first-order  equation  for  the  ray.  With  the  dN/dd'n  term  dropped  as  of 
second  order  in  e,  it  is,  in  Cartesian  coordinates  (xn,  yn,  zn)  (fig.  2). 
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(17) 


where  r0  = zn  + r„xn.  The  vectors  x‘n,‘yn,  and  Ta  are  the  unit  vectors  of  the  coordinate  system. 

For  large  values  of  |2„  |,  where  the  ray  is  outside  the  effective  medium,  the  upper  limit  may  be  set  equal  to 
infinity.  Wi*h  this  limit  (17)  yields  straight-line  equations  representing  the  asymptotic  form  of  the  ray  outside  the 
medium: 


xn  ~ emx (rn)zn  ~ ecx (rn)  + rn 
y„  = emy{rn)zr.  - ecy(r„) 


mx(rn) 


> 


J 


(18) 


(19) 


Here,  mx  and  my  are  the  slopes  for  the  ray  with  nearest  radius  rn  and  position  angles  d‘n,  <p'n  ; cx  and  cy  are 
the  constant  terms  of  the  straight  line  equations  and  may  be  shown  to  be  related  to  m by 
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Operationally  (20)  will  be  represented  by  e = T(m). 

The  ray  path,  projected  onto  the  xnzn  and  ynzn  p'anes  is  shown  in  figure  3.  From  this  figure  and  Eqs. 
( 18)  and  (19),  the  spacecraft  coordinates  may  be  expressed  in  terms  of  the  projection  of  ps  on  the  xnzn  plane, 
narneh  pm,  and  onto  the  ynzn  plane,  y$n  ■ 


?sn  = 2emxzn-eTimx)  + r„ 

(21) 

y’sn  = 2 emyZn-eT(my) 

Due  to  the  small  bending  of  the  rays  the  zn  axis  and  the 
zs  axis  will  be  rotated  from  each  other  by  small  angles  of 
the  order  ot  e,  and  we  may  set  ps  **  pm , and  zs  = zsn , 
the  coordinate  of  the  spacecraft  along  the  zn  axis.  From 
Figure  2 we  also  see  that  y ^ « Ps(4>s-Q, , * Also,  from 
the  definitions  of  mx,niy.g,h , and  e we  have 

mx(rn ) =4  g(rn)  + W'n-eo) 

drn  J 
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myirn)  =T—e(rn) 
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Figure  3.-  Projections  of  ray  paths.  The  upper  figure  shows 
the  projection  of  the  ray  path  into  the  xnzn  plane.  The 
lewer  figure  shows  the  projection  of  the  ray  into  the 
plane. 


so  that,  from  (21)  and  the  approximations  associated  with  the  small  angle  between  the  z axes. 


ps  = f.zsG(rn)  + rn 


Since  bending  in  the  x„zn  plane  equals  emx  to  first  order  in  e, 
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where  in  (23)  and  (24) 


, d/i(r,,)  de(rM) 

G(rn)  s g(r„)  + (0'n-6o)  + (^-0O) 


Equations  (16),  (23),  and  (25)  form  the  desired  set  of  equations  which  can  be  solved  if  frequency  or  spatial 
dispersion  is  utilized  to  provide  the  data.  That  this  is  so  will  be  demonstrated  for  an  ionosphere  with  radiation  at 
two  different  frequencies.  Since  e contains  the  frequency  dependent  quantity,  its  value  will  differ  for  each  frequency. 
For  a given  value  of  r„  for  both  frequencies  the  values  of  ps,  zs,f(ps),  tf>s,  <i>'n.  and  0'n  will  be  different.  When  two 
frequencies  are  radiated  simultaneously,  their  corresponding  values  of  rn  will  be  different,  but  if  the  frequencies 
are  close  enough  or  bending  of  the  rays  is  slight,  rays  from  both  frequencies  will  reach  the  same  rn  at  slightly 
different  times.  We  utilize  the  zs  and  <t>s  from  spacecraft  trajectory  data  as  a function  of  ps,  and  /(ps)  is  obtained 
from  the  measured  Doppler  for  each  frequency  and  the  trajectory  data.  For  a given  rn,  the  corresponding  ps  for 
each  frequency  is  not  known.  To  indicate  that  quantities  are  associated  with  each  of  two  frequencies  we  utilize  the 
subscripts  /,  where  / takes  on  the  values  1 and  2.  Then,  since  the  0'n  terms  contribute  to  order  e‘ , the  equations 
to  first  order  in  e from  (16)  and  (23)  through  (25)  become 


ei-fi(Psi)  = e*G/('/,)  + ei  e<rn) 

dPsi 


Psi  = €izsi  (Psi)  0 + fn 

. - / r,  u e»  L 1 Je(rn) 

<t>si  ~ tni^n)  + \zsi(PsO T H 


rn  2 (r„ 


?i(rn)  = g(r„ ) + ^(r,,)  - 


Foragiven  r„,  with  i = 1,2,  we  have  a set  of  eight  equations  in  the  eight  unknowns  Psi,Pj;,4^ji,0m2,Gi,Gj, 
e.  and  g.  Note  that  to  first  order  in  e,  3<I>  / dd'n,  cannot  be  determined.  By  carrying  out  the  expansion  to  second 
order  m e and  utilizing  three  frequencies,  this  quantity  may,  in  principle,  be  determined. 

The  set  of  equations  (26)  may  be  solved  numerically  for  the  functions  g(rn)  and  e(rn).  From(10),  (13).  and 
(14)  and  r 2 =rni  + s1  v 


r 1 w 

S(rn)  = ar  dr 


e(rn)  - A 


(Mlb<p'n)r  dr 


- l 


These  equations  may  be  inverted  by  Abel  transforms  yielding 
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a n 

—~(r  60,<t>0)  = 

dr 


T~( r'do,<t>0 ) = 
000 


where  the  prime  denotes  differentiation  with  respect  to  the  argument.  We  see  that  dN/dr  and  dN / 90o  from 
(28)  are  the  radial  and  angular  components  of  the  gradient  transverse  to  the  ray,  with  d0  and  <t>(,  taking  on  values 
in  the  vicinity  of  the  nearest  point  of  each  ray. 


NUMERICAL  CALCULATIONS 


Calculations  were  made  for  a model  of  an  ionosphere  distributed  about  a planet  in  which  the  electron  density 
varies  with  radius  and  solar  zenith  angle  and  the  temperature  or  scale  height  varies  with  this  angle.  The  top  of  the 
layer  is  exponentially  decreasing.  Below  there  is  a peak  in  the  ionization  with  a density  that  varies  about  the  planet 
but  at  a fixed  altitude.  The  model  is  described  in  terms  of  a spherical  coordinate  system  (r,  6 , 0)  with  the  polar  axis 
in  the  direction  from  the  planet’s  center  to  the  sun.  The  origin  is  at  this  center,  and  the  polar  angle  6 is  very  closely 
equal  to  the  zenith  angle.  The  medium  has  axial  symmetry  in  0.  At  fiequencies  well  above  the  maximum  plasma 
frequency  the  index  of  refraction  is  given  approximately  by 

Here  e = 2nelnmlmui  , nm  is  the  maximum  density  at  6 = w/2  (the  terminator),  e and  m are  the  electronic 
charge  and  mass, 


P(R,0)  =~exp- 


R ~Rm  _ 2(1  + <7  cos  0) 
1 + q cos  0 


for  the  exponential  layer,  and 


for  the  parabolic  peak  layer,  where  R = (r-rp)/ria  the  altitude  in  scale  height  units,  Rm  is  the  altitude  of  the  level 
of  maximum  density  in  scale  height  units,  Ha  is  the  normalizing  scale  height  and  corresponds  to  the  scale  height 
at  6 = rr/2,  r is  the  radius,  rp  is  the  radius  of  the  planet,  dj  and  dj  are  parameters  controlling  the  density  level  of 
the  entire  ionosphere  and  such  that  (ds  + da)/2  = 1 , v is  a parameter  controlling  the  rate  of  change  of  density  with 
6,  and  q is  a parameter  controlling  the  distribution  of  temperature  with  6.  The  subsolar  profile  is  for  0 = 0,  and 
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the  antisoLr  profile  is  for  6 = it.  For  large  values  of  v the  ratio  of  subsolar  to  antisolar  densities  at  the  peaks  is 
equal  to  dj/d^. 

For  the  purpose  of  numerical  calculations  we  chose  the  ratio  of  subsolar  to  antisolar  electron  densities  as  20: 1 . 
The  value  of  q us;d  is  0.5,  which  gives  a scale  height  ratio  of  subsolar  to  antisolar  profiles  of  3:1.  The  value  of  v 
was  set  equal  to  10  for  three  occultations.  Ninety  percent  of  the  change  in  electron  density  occurs  within  a band  of 
25°  centered  at  the  terminator  ( 6 = ir/2)  for  this  value  of  v.  One  occultation  was  calculated  fur  v = 100  for  which 
90%  of  the  change  occurs  in  a 3°  band.  The  ratio  of  the  radius  at  the  peak  to  the  scale  height  was  set  equal  to  105  : 1 . 
The  corresponding  radius  of  the  planet  is  100.  The  values  of  e for  the  two  frequencies  were  5X  10-5  and 
1.125X10“*. 


Figure  4.-  Occultation  orbits.  The  three  figures  show  the  elliptical 
projection  of  a circular  orbit  into  the  plane  perpendicular  to  the 
viewing  direction.  The  heavy  line  indicates  the  part  of  the  orbit 
used  in  the  experiment  (it  is  also  an  approximation  to  the  locus 
a;  nearest  points).  The  shaded  portions  indicate  the  night-side 
hemisphere  of  the  planet. 


Figure  5.  - Gradients  of  N for  occultation  indicated  in  figure  4a. 
The  absolute  values  of  the  gradients  of  N along  the  locus  of 
nearest  points  vs.  normalized  altitude  are  shown.  The  radial 
gradient  above  the  occultation  point  is  also  indicated. 


If  the  normalizing  scale  height  is  30  km,  the  radius  to 
the  peak  is  3150  km;  then  the  model  chosen  could  be  of  the 
scale  of  the  Martian  ionosphere.  If  the  plasma  frequency  at 
the  peak  level  at  6 - jr/2  is  3.5  MHz,  the  frequencies  associated 
with  the  chosen  values  of  e would  be  500  and  750  MHz.  For 
the  values  of  e the  separation  of  the  rays  at  their  nearest  points 
for  the  two  frequencies  neve'  exceeds  0.1  scale  height. 

The  spacecraft  is  assumed  to  be  in  a circular  orbit 
of  radius  120  Ha  and  inclined  to  the  direction  of  the 
earth,  so  that  its  elliptical  projection  has  a semiminor 
radius  of  1 00  Ha . The  axes  of  the  ellipse  are  either  at  45'  to 
the  axis  of  symmetry  or  to  its  projection  as  viewed  from 
earth  for  all  examples. 

The  first  computation  is  for  v = 10  and  for  an 
occultation  starting  on  the  night  side.  It  is  viewed 
perpendicular  to  the  axis  of  symmetry  as  shown  in 
figure  4(a).  The  absolute  values  of  the  radial  and  trans- 
verse angular  components  of  the  refractivity  gradient 
along  the  locus  of  nearest  points  are  shown  in  figure  5. 
both  for  exact  values  available  from  the  model  and 
computed  values  deduced  by  inversion  procedures  using 
computed  Doppler  shift  da  1 The  angular  component 
is  the  horizontal  componer  verse  to  the  rays. 

Radial  gradients  are  shown  ior  inversion  with 
and  without  corrections  for  the  angular  component  of 
the  gradient.  The  actual  radial  gradient  for  the  profile 
over  the  occultation  is  also  plotted.  It  is  seen  that  if 
inversion  is  performed  neglecting  angular  variations 
and  if  it  is  assumed  to  yield  the  profile  over  the  occulta- 
tion point,  the  interpretation  would  be  considerably  in 
error.  The  radial  gradient  errors  are  much  smaller  when 
compared  with  the  actual  profile  but  still  are  as  large  as 
50%  in  the  exponential  region  and  greater  in  the  para- 
bolic region.  Inclusion  of  the  angular  variation  reduces 
the  error  in  the  radial  component  by  more  than  a factor 
of  2.  The  computed  and  actual  angular  components 
compare  very  well  in  the  exponential  region,  but  the 
comparison  becomes  much  poorer  below  the  peak. 
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If  the  planet  is  viewed  with  the  axis  of  symmetry 
at  an  angle  of  45°  to  the  plane  of  projection  as  viewed 
from  earth,  with  the  occultation  as  in  figure  4(b),  one 
obtains  the  graph  shown  in  figure  6.  The  results  are 
essentially  the  same  as  before. 

Figure  4(c)  indicates  an  occultation  on  the  day 
side  through  a region  where  the  horizontal,  or  angular, 
components  of  the  gradient  are  small.  In  this  case,  all 
the  computed  radial  components  of  the  gradient,  the 
profile  along  the  locus  of  nearest  points,  and  the  profile 
over  the  occultation  point  compare  very  well  (fig.  7). 
The  comparison  of  horizontal  gradients  is  poor,  but 
the  results  do  indicate  that  it  is  smaller  than  the 
radial  component  by  a factor  of  100. 


Figure  6.-  Gradients  of  N for  occultation  indicated  in  figure 
4b.  The  absolute  values  of  the  giadients  of  JV  along  the 
locus  cf  neaiest  points  vs.  normalized  altitude  are  shown. 
The  radial  gradient  above  the  occultation  point  is  also 
indicated. 


With  v - 100  and  the  configuration  in  figure 
4(a)  one  obtains  the  results  shown  in  figure  8.  The 
rays  pass  through  a region  of  large  variation  of  re- 
fractivity  at  r///fl  = 108.5.  Two  distinct  peaks  are 
obtained  in  the  radial  component  when  the  angular  variation  is  neglected.  Inversion  incorporating  angular  effects 
considerably  improves  the  results.  Although  the  horizontal  gradient  from  inversion  is  poor,  the  large  peak  is  indicated 
The  results  exhibit  the  difficulties  when  very  large  angular  gradients  are  involved.  Refinements  in  numerical  pro- 
cedures may  improve  the  results.  The  increments  used  in  inversion  may  be  too  coarse. 
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Figure  7.-  Gradients  of  N for  occultation  indicated  in 
figure  4c.  The  absolute  values  of  the  gradients  of  N 
along  the  locus  of  nearest  points  vs.  normalized 
altitude  are  shown.  The  radial  gradient  above  the 
occultation  point  is  also  indicated.  The  actual  hori- 
zontal gradients  are  very  small  and  only  a small  por- 
tion is  indicated. 


Figure  8.-  Gradients  of  N for  occultation  indicated  in  figure 
4a  and  with  large  angular  gradients.  The  absolute  values 
of  gradients  of  N along  the  locus  of  nearest  points  vs. 
normalized  altitude  are  shown.  The  radial  gradient  above 
the  occultation  point  is  also  indicated. 
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DISCUSSION  AND  CONCLUSIONS 


The  feasibility  of  determining  both  radial  components  and  angular  components  of  the  refractivity  trans- 
verse to  a set  of  rays  in  an  occultation-type  of  experiment  lias  been  demonstrated.  The  angular  components 
are  found  by  using  frequency  dispersion  for  an  ionosphere  and  spatial  dispersion  for  a neutral  atmosphere. 

The  analysis  is  for  slightly  refractive  media  of  exponentially  limited  radial  extent  in  which  the  angular  compon- 
ent of  the  gradient  is  not  strong,  so  that  a Taylor  series  in  angular  coordinates  may  be  cut  off  at  the  first 
derivatives.  With  the  exception  of  deep  atmospheres,  it  is  likely  that  most  regions  about  a planet  will  be  in 
accordance  with  these  conditions.  Deep  atmospheres  will  have  large  radial  gradients  due  to  their  large  densities, 
and  the  residual  Doppler  may  be  mostly  from  the  radial  term.  The  angular  contribution  may  be  masked  and 
more  difficult  to  deduce.  The  physics  of  deep  atmospheres,  however,  may  limit  the  extent  of  the  angular 
variations. 

The  practical  feasibility  of  utilizing  dispersion  is  more  limited.  Accurate  Doppler  measurements  at  more 
than  one  frequency  would  be  required  for  ionization,  and  dual  or  multiple  payloads  would  be  required  for 
neutral  media.  The  position  of  the  spacecraft  about  the  planet  must  be  accurately  known,  and  the  effects  of 
inaccuracies  on  the  determination  of  angular  dependence  cf  the  medium  requires  further  study. 

The  occultation  experiment  provides  information  on  refractivity  of  th*  medium  and  does  not  necessarily 
determine  the  model  of  the  atmosphere.  It  could  provide  scale  height  information  from  which  temperature 
could  be  determined  for  the  assumed  mass.  However,  if  the  temperature  varies  with  altitude,  only  tne  density 
scale  height  is  determined,  which  is  not  simply  related  to  the  temperature.  The  horizontal  gradients  could  pro- 
vide information  useful  to  studies  of  possible  horizontal  motions  of  the  medium.  If  combined  with  temperature 
structure  information,  even  with  the  recognized  uncertainty  of  composition  and  scale  height  interpretation,  the 
measurement  of  horizontal  g'adient  could  serve  as  a means  for  understar  ding  the  processes  controlling  the 
distribution  with  latitude  and  longitude.  Since  the  physical  theory  of  atmospheres  and  ionospheres  usually 
separates  vertical  and  horizontal  control  of  the  media,  it  is  difficult  .0  interpret  data  along  a locus  of  nearest 
points  that  is  at  an  angle  to  cither  direction  without  separation  of  the  components  of  the  gradient  of  the 
refractivity.  The  present  form  of  the  occultation  experiment  only  approximates  the  total  derivative  along  the 
locus  which,  in  general,  is  different  from  the  radial  Scale  heights  derived  from  it  could  be  misleading  or  useless 
to  interpret,  since  it  would  correspond  to  the  variation  along  the  locus.  On  the  other  hand,  the  partial  deriva  ives 
with  respect  to  radius  and  angle  along  the  locus  would  at  least  separate  the  radial  portion.  One  can,  at  least, 
ittempt  to  approximate  a profile  along  radials  to  come  closer  to  the  density  scale  heights  for  the  range  of  angular 
coverage  of  the  occultation.  Measurement  of  higher  angular  derivatives  would  provide  better  accuracy  in  restruc- 
turing the  radial  profiles;  however,  the  present  practicability  appears  limited. 

Radial  components  of  the  gradient  of  refractivity,  as  exhibited  in  figures  S to  8,  may  be  interpreted  in 
terms  of  the  density  scale  height  along  the  locus  of  nearest  points,  since  the  data  may  be  graphically  analyzed 
to  fit  a simple  exponential  model  within  a region  of  altitudes.  The  graphs  of  the  radial  components  in  figures 
5 and  6 appear,  at  first  sight,  to  be  roughly  parallel  but  displaced  from  each  other.  Such  parallelism  implies 
equai  scale  heights  along  the  locus  with  the  separation  due  principally  to  the  magnitude  of  the  refractivity. 

Closer  examination  shows  that  the  lnes  are  not  quite  parallel,  implying  differences  in  their  scale  heights.  When 
the  radial  gradient  as  represented  by  the  partial  derivatives  is  combined  with  the  refractivity  computed  along 
the  locus,  the  radial  scale  height  may  be  computed  as  a function  of  position  along  the  locus.  In  general,  this 
will  differ  substantially  from  that  of  the  variation  of  refractivity  along  the  locus  and  will  exhibit  the  functional 
dependence  on  angle.  The  results  for  the  case  of  figure  5 are  plotted  in  figure  9. 

We  have  ignored  the  effects  of  the  interplanetary  medium.  Gradients  may  be  present  in  this  region  that 
could  cause  difficulties  in  establishing  the  sensible  extent  of  the  medium.  The  practice  is  to  subtract  out  th  5 
contribution.  One  method  utilizes  two  frequencies  [Fjeldbo  and  Eshleman,  1969]  on  the  basis  that  the  con- 
tribution of  the  interplanetary  medium  will  be  the  same,  except  for  a frequency  scaling  factor,  for  both  fre- 
quencivs.  The  ray  paths  to  earth  will  be  slightly  different  for  both  frequencies,  and  if  gradients  are  large  enough. 
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the  rays  for  each  of  the  two  frequencies  could 
be  traveling  through  regions  of  different  densities. 
Much  more  information  is  needed  on  the  detailed 
structure  of  the  interplanetary  medium  to  judge 
this  aspect.  When  two  frequencies  are  utilized  to 
attempt  to  cancel  out  the  contribution  of  the 
interplanetary  plasma,  then,  within  the  context  of 
this  discussion,  a third  frequency  would  be  neces- 
sary to  measure  the  ionospheric  radial  and  angular 
gradients. 

The  use  of  a modified  occupation  experi- 
ment deserves  further  study.  Its  use  is  a compro- 
mise between  the  constraints  of  spacecraft  payload 
and  scientific  needs.  At  best,  it  cannot  compare 
well  with  direct  probing  methods  and  soundings, 
methods  that  require  far  more  spacecraft  weight 
and  power. 


Figure  9 - Relative  scale  heights  derived  from  the  gradients 
of  Figure  5.  The  actual  relative  scale  height  is  unity. 
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DISCUSSION 


Croft:  There  ;s  a history  here  to  a closely  allied  problem  in  the  Rus.ian  literature.  Gringauz  and  Alpert  had 

a luntiinj  commentary  from  19<  5 to  1968.  One  was  measuring  the  Doppler  shift  signal  from  passing 
satellite.  They  calculated  the  u.  elated  electron  content  <s  the  satellite’s  a'itude  cnanged.  He  inter- 
preted the  dat  in  terms  of  heigh  profiles.  The  other  author  said  you  couldn’t  do  this  because  there  are 
localized  irreg  .iaritics  having  horizontal  gradients  tl  at  would  show  up  as  layers  in  your  stratified  models. 

If  > .M  start  with  a single-valued  functio  i of  one  pi:  imeter,  phase  versus  time,  I don’t  see  how  you  create 
two  functions  of  h-  aht. 

Gnm;  Nr  ' u have  two  simultaneous  measurements.  If  you  can  measure  for  two  loci  at  two  frequencies 
you  have  iwo  sets  of  measurements.  If  the  locus  of  nearest  points  happen  to  be  tangent  to  a line  of 
constant  rcfract.vity  and  if  you  assume  spheried  symmetry,  the  derivative  with  respect  to  r will  be  zero 
at  these  points.  Perhaps,  you  then  interpret  this  as  a peak  or  a minimum,  either  in  the  ionosphere  or  neutral 
atmosphere.  This  could  he  ntireiv  wrong.  When  you  have  the  components  of  the  gradient  you  are  then 
able  to  separate  the  information. 

Grossi:  The  Russian  problem  mentioned  by  Croft  is  much  more  difficult  because  you  have  the  telemeter  on 

the  ground  and  a telemeter  on  the  satellite  so,  for  instance,  the  ba:  io  equation  used  in  occultation  where 
the  integral  of  the  Dopp!  r leads  you  to  a phase  difference  wit  i an  integruion  constant  equal  to  zero  is  no 
longer  applicable  m t ie  Gringeuz  and  Alpert  case.  Many  points  of  their  dispute  are  therefore  not  applicable 
here. 
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Inversion  of  radio  occultation  data  for  planetary  atmospheres  and  ionospheres  has  been  performed  by  the 
authors  using  the  seismological  Herglotz-Wiechert  method,  as  adapted  by  Phinney  and  Anderson  to  the  radio- 
occultation  case.  This  method  does  not  require  the  assumption  of  straightline  behavior  for  the  radiorays. 

Profile  reconstruction  performed  in  computer-simulated  experiments  with  this  approach  have  been  compared 
with  the  ones  obtained  with  the  straight-ray  Abel  transform. 

For  a thin  atmosphere  and  ionosphere,  like  the  ones  encountered  on  Mars,  microwave  occultation  data  can  be 
inverted  accurately  with  both  methods.  For  a dense  ionosphere  like  the  sun’s  corona,  ray  bending  of  microwaves  is 
severe,  and  recovered  refractivity  by  the  Herglotz-Wiechert  method  provides  significant  improvement  over  the  straight- 
rey  Abel  transform:  the  error  reduces  from  more  than  60%  to  less  than  20%  at  a height  of  60  X 103  km  above  the 
bare  of  the  corona. 


INTRODUCTION 


Radio  occultation  measurements  of  planetary  atmospheres  and  ionospheres  have  become  an  integral  part  of  the 
standard  scientific  investigations  performed  by  NASA  with  planetary  probes  since  the  mid-1960’s,  when  various 
spacecraft  of  *he  Mariner  class  performed  pioneering  radio  occultation  experiments.  These  experiments  provided 
the  first  reliable  measures  of  the  profiles  of  the  atmosphere  and  ionosphe.e  of  Mars  [Kliore  et  al,  1965  ; Fjeldbo  etal, 
1966;  Fjeldbo  ana  Eshleman,  1968]  andofVenus  [ Kliore  etal , 1967;  Mariner  Stanford  Group,  1967;  Fjeldboand 
Eshleman,  1969] 

The  data  are  obtained  in  the  form  of  “Doppler  shift  residuals”  (characterized  by  errors  with  lo  * 1 mm/sec), 
which  are  derived  by  subtracting  from  the  total  observed  Doppler  affecting  a 2200-MHz  microwave  carrier  the 
amount  due  to  the  link’s  geometric  changes.  The  presence  of  a nonzero  residual  (for  situations  in  which  both  link 
terminals  are  outside  the  medium  under  probing)  is  indicative  of  a change  in  the  takeoff  or  arrival  angle  of  the  ray 
thdi  propagated  (oneway  or  twoway)  between  ;arth  ?nd  spacecraft  and  traveled,  for  at  least  a portion  of  its  path, 
through  the  ionosphere  and  the  atmosphere  of  the  planet. 

The  inversion  of  the  data  has  been  performed  with  a variety  of  geometric  optics  techniques  ranging  from  the 
closed-form  Abel  transform  (based  on  the  assumptions  th».i  the  medium  possesses  spherical  symmetry  and  straight- 
line  behavior  for  the  ray)  to  model-fitting  approaches  based  on  iterative  procedures.  In  principle,  for  a planet  with 
a thin  atmosphere  like  Mars,  all  these  methods  are  adequate.  For  cases  like  Jupiter  or  the  sun’s  corona,  closed-form 
inversion  algorithms,  desirable  in  many  respects  above  model-fitting  approaches,  must  exclude  the  straight-line 
approximation. 
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One  suitable  method  is  available  from  seismology,  based  on  the  Herglotz-Wiechert  approach  for  interpreting 
seismic  data  [Herglotz,  1907;  Wiechert  and  Geiger,  1910].  Another  method  has  been  published  recently  by  Fjeldbo 
etal.  (1971). 

In  seismology,  this  bent  ray-path  technique  leads  to  the  derivation  of  velocity-depth  profiles,  and  to  the  calcu- 
lation of  the  position  and  the  identification  of  geological  features  in  the  earth's  mantel.  The  observables  are  the  travel 
times  of  seismic  waves  between  stations  located  on  the  earth  surface.  Phinney  and  Anderson  (1968)  have  shown  that 
this  method  '.s  usable  for  the  situations  in  which  the  observables  are  the  Doppler  residuals  of  the  radio  occuitation 
measurements.  The  purpose  of  our  work  has  been  to  code  their  formulas  in  a computer  program  and  to  use  it  in  the 
reduction  and  processing  of  actual  radio  occuitation  data. 

The  seismological  approach  has  been  applied  to  the  inversion  of  simulated  occuitation  data  for  the  sun’s  corona 
and  Mars.  Application  to  actual  experimental  data  is  under  way.  The  inversion  accuracies  have  been  compared  with 
the  ones  that  characterize  the  inversions  obtained  with  the  straight-ray  Abel  transform.  We  have  found  that  for  cases 
of  strong  ray  bending  (as  occurs  in  the  microwave  occuitation  probing  of  the  sun’s  corona),  the  errors  with  the  Herglotz- 
Wiechert  approach  are  significantly  smaller  than  with  the  straight-ray  Abel  transform.  However,  for  thin  atmospheres 
they  are  comparable,  as  expected. 

A computer  program  based  on  the  Phinney  and  Anderson  analysis  has  been  developed  in  preparation  for  the 
reduction  of  the  occuitation  data  to  be  gathered  in  1976  by  the  Viking  spacecraft  for  Mars  and  the  Sun,  and  by  such 
missions  as  the  Grand  Tour  Mission  to  the  outer  planets.  Meanwhile,  the  program  is  being  used  as  a test  to  reduce 
cases  of  past  occuitation  measurements  like  the  one  performed  by  Mariner  4 to  Mars. 

Comparative  sensitivity  to  horizontal  gradients  and 
applicability  of  the  seismological  approach  developed  by 
Gerver  and  Markushevich  [1966,  1967]  to  situations  of 
waveguidance,  as  encountered  for  instance  in  radio  occulta- 
tions  occurring  at  the  base  of  the  Venus  atmosphere,  are 
under  investigation.  The  applicability  of  the  seismological 
approach  developed  by  Backus  and  Gilbert  [1967, 1968, 

1970]  is  also  under  study.  All  these  analyses  require  the 
computation  of  the  impact  parameter  of  the  radio  ray  from 
the  observed  Doppler  residuals,  a derivation  that  has  been 
proven  feasible  by  Phinney  and  Anderson  [1968] . 


PROFILE  INVERSION  BY 
STRAIGHT-RAY  ABEL  TRANSFORM 


The  integration  in  the  time  domain  of  the  Doppler 
residuals  provides  the  differential  ph  ise-path  length.  This 
quantity  is  the  difference  between  the  straight-line  geomet- 
rical distance  between  transmitter  and  receiver  and  the 
phase-path  length  for  radio  waves.  The  integration  constant 
is  zero  when  provisions  are  made  for  starting  the  integration 
from  a position  of  the  probing  link  completely  external  to 
the  medium  under  evaluation. 


With  reference  to  figure  1 and  assuming  a straight- 
line  ray  the  spherically  symmetric  medium  (index  oi 
refraction  n is  a function  of  only  of  r),  we  obtain  the 
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differential  phase-path  length 
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This  integral  is  a function  of  the  distance  of  the  ray  from  the  center  of  the  planet  at  the  point  of  its  closest 
approach  to  the  planet’s  surface.  We  call 
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By  changing  the  variable  ds.  we  can  write  (fig.  1): 
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In  this  equation  the  radicand  in  the  denominator  is  nonnegative  because  for  every  value  of  the  miss-distance 
range  p of  the  ray  from  the  center  of  the  planet  r > p.  Let 
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Applying  Abel’s  integral  equation  we  have, 
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Because  of  the  change  in  the  limits  of  integration,  the  radicanc*  in  the  denominator  is  nonnegative.  In  fact,  for 
every  radial  height  r,  N(r)  and  n(r)  are  obtained  from  columnar  measurements  made  at  miss-distance  p 
rlways  larger  than  r. 

The  straight -ray  Abel  transform  approach  described  above  has  been  applied  to  the  Martian  ionosphere/ 
atmosphe.e  and  the  sun's  corona,  as  representative  of  thin  and  thick  media. 

A Hamiltonian  ray  tracing  program  was  used  in  conjunction  with  the  chosen  models  to  derive  the  simulated 
differential  phase-path  lengths  for  various  values  of  p. 

Figure  2 illustrates  the  model  used  for  the  Mars  refractivity  profile  (MODEL)  and  the  Abel  transform 
reconstruction  of  this  refractivity  model  (ABEL).  Table  1 lists  the  difference  in  N units  between  the  model 
refractivity  profile  and  the  Abel  transform  reconstruction.  Figure  3 and  table  2 show  the  same  functions  for 
the  sun’s  corona. 

It  is  evident  that  the  Abel  transform  provides  excellent  results  for  the  case  of  the  “thin”  Martian 
ionosphere  and  atmosphere.  The  errors  b“come  intolerable,  however,  in  the  reconstruction  of  the  sun's  corona 
when  the  validity  of  the  straight-line  approximation  for  the  ray  breaks  down. 
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Figure  2.-  Model  cf  Man  refractivity  profile  and  its 
reconstruction  by  Abel  transform  (rectangles  traced 
around  the  pain  of  circles  and  squares  identify  the 
original  and  the  reconstructed  point). 


Figure  3.-  Model  of  the  Sun  corona  refractivity  profile  and 
its  reconstruction  by  Abe!  transform  (rectangle*  traced 
around  the  pain  of  circles  and  squares  identify  the 
original  and  the  reconstructed  |>oint). 
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TABLE  1.  ORIGINAL  AND  RECONSTRUCTED  MO  DLL  IOR 
ATMOSPHERE  AND  IOv  OSPHERE  OL  MARS 
(RECONSTRUCTION  B\  ABEL  TRANSLORM) 


I’rop.igation  Lrequency  = 2000  MHz 


Pbnetocentric  Radius 

Differential  1 nase 

Hctractivity 

Refractivity 

Delay 

(Original) 

( Reconstruct! 

3385  8038408 

.00049873 

2.6910007 

2 5343329 

3390  7541166 

-.00006561 

1.2969379 

1.1127441 

3395  6977021 

-00028884 

7342651 

.7000476 

3400  6116347 

-00043633 

.4823507 

.4507898 

3405  5248131 

-0005452"’ 

.2818609 

.2533°''! 

3410.4371543 

-.00062016 

.1327903 

,107"<  •» 

3415  3485406 

- 00066  717 

0351334 

.015„015 

3420.2587376 

- OOOf Q>22 

-0111152 

-0243288 

3425  1669661 

- 300 ’07 59 

-0059610 

-.0112774 

3430.0736711 

-00072737 

0 0000000 

-0093382 

3434.9810468 

- 03073000 

-.0000000 

-.0094812 

3439.8883434 

- 00077536 

-.0^00018 

- 0095012 

3444.7955183 

-.30080421 

-0000454 

- 0092206 

3449.7024877 

-.00083780 

-0005587 

-0090053 

3454.6091278 

-.00087818 

- 0038  784 

-0115760 

3459.5153911 

-00092821 

-0171971 

-0252309 

3464.4215252 

-.00099024 

-0536946 

-0647943 

3469  3281224 

-.00106342 

-.1274150 

- 1446981 

3474.2358337 

-00114185 

-.2439253 

-.2691008 

3479  1449747 

-.00121516 

- 3947539 

-4272813 

3484.0553671 

-.00’  ’5 

-.5601490 

-.5978040 

3488.9664814 

-.00  30720 

-.7171498 

-.7571699 

3493  8777050 

- 00131317 

-8471618 

-.8874073 

3498.7885483 

-.00129184 

-.9396791 

-.9789639 

3503.69872?i 

-.001 24699 

-.9922043 

-1 .0303 1 ^8 

3508.60815,  5 

-.00118405 

-1.0080616 

-1.0456119 

3513  516P  98 

-.00110876 

-9937569 

-1.0318672 

35 18.425 '’086 

-.00102635 

-.9568190 

-.9909566 

3523.3?  -6833 

-.00094117 

-.9044097 

-.9484911 

3528.2  00366 

-.00085648 

-.8426.  73 

- 893047. 

3533  . 171808 

-00077461 

-.7762285 

•.8368U53 

353f.  0542042 

-.00069709 

-.7087782 

-.7933753 

3'/  2.961 1723 

-.00062482 

-.6427287 

-638‘>328 

3:5  ' 7.8681 304 

-.00055822 

-.5796891 

-.5759056 

1:52.7751089 

-.00049739 

-5206237 

-.5169908 

3557.6821263 

-.00044222 

-.4660302 

-4626115 

3562.5891934 

-.00039245 

-.4160816 

- 4129212 

3567.4963147 

-.00034772 

-.3/07338 

-.3678395 

3572.4034909 

-.00030765 

-.3298046 

—.3271  SC"7 

3577.3107209 

-00027183 

-.2930299 

-.2906-oj 

3582  2180C17 

-.00023988 

-.2601030 

-.2580032 

3587.1253293 

-.00021141 

-.2307005 

-.2287441 

3592  0326997 

-.00018607 

-.2045000 

-.2028152 

3596.9401086 

-.00016352 

-.181 1*08 

-.1796952 

3601.8475524 

-.00014346 

-.160*799 

- 1591 560 

3606.7550269 

-.00012562 

-.1420959 

- 1409339 

3611.6625288 

-.00010975 

-.1257898 

- 1247573 

3616.5700548 

-.00009563 

-.1  11335.' 

-.1104472 

3621.4776021 

-.00008303 

-.0985  2?  ■ 

-.0977606 

3626.3851679 

-.00007179 

-.09  71 8.-6 

-.0865069 

3631.2977502 

-.00006175 

- 0771 4 20 

-.0765859 

3636.2003466 

-.00005273 

-.0682510 

-.0678084 

3641  ,079552 

-.00:  4459 

-.06', ’3816 

-.0600339 

1IIL 


Refractivity  Differcnc 


-1. 56667831-01 
- 1 . 84 1 9 3 7 0 K —0 1 
-3  421  752 1 L— 02 
-3.1560943E-02 
-2  84621271  -02 
-2  4S14>  16L-02 
-2.01318641  -O' 
-1  3213628!  -r  2 
-5  3163954L-  -,)3 
-9  33819: M -03 
—9.4  81  1 74 1 i 03 
-9  49940161-03 
-9.17524221  -03 
-8  44668831-03 
-7.69“5947l  -03 
-8  O'1 3 7 571  -03 
-1  ’0996841.-02 
-1  7283072L-02 
- 2.51 75S55L— 02 
-3  2527304E-02 
-3  76549901  02 
-4.00200881  -02 
-4.024  5454E-02 
-3 9.84811 E-02 
—3.8 1 085 1 2L  — 02 
-3.755031 2E-02 
—3.81 1 _ 343L-02 
-4.0137608E-02 
-4  3081403E-02 
-5.0430333E-02 
-6.058175’E-02 
-8.4597098E-02 
3.7959433E-03 
3.7835888E-03 
3.6329555E-03 
3.4187144E-03 
3.1604393E-r3 
2.8943039E-03 
2.6238853t-03 
2.3866  255L-03 
2 09976°3h-03 
1.9063353E-03 
1.68481 20L-03 
1.495607 .1  -03 
i 1 139359E-03 
: i ,‘':67E-03 
1 0.-2481 3E-03 
'•  OS’  'S48E-0' 
I 6d05197E-o; 
6.78",5790E-  34 
5.56161851  -04 
4.4254360  04 

3.4  76466  oE-U' 


TABLE  1.  ORIGINAL  AND  RECONSTRUCTED  MODEL  FOR  THF 
ATMOSPHERE  AND  IONOSPHERE  OF  MARS 
(RECONSTRUCTIC  N BY  ABEL  TRANSFORM) 
(Continued) 

Propagation  Frequency  = 2000  MHz 


Planetocentnc  Radius 

Differential  Phase 
Delay 

Refractivity 

(Original) 

Refractivity 

(Reconstructed) 

Refractivity  Difference 

3646.0155746 

-.00003720 

-.0534  1 74 

-.0531748 

2 4255063E-04 

3650.9232031 

-.00003042 

-.0472549 

-.0472093 

4.5665114E-05 

3655.8308389 

-.00002406 

-0418024 

-0419971 

-1.9476145E-04 

3660.7384804 

-.00001788 

-.0369783 

-.0378509 

-8.7267045E-04 

3665.6461239 

-.00001127 

-.0327104 

-.0378314 

-5  1210458E-0? 

3670.5537461 

-.00000000 

0 0003000 

.0000006 

5.7527000E-07 

36  75.4614384 

-.00000000 

0 0000000 

0000007 

6.957485  2E-07 

3680  3691308 

-.00000000 

0 0000000 

.0000008 

8.0/21452E-07 

3685  2768231 

-.00000000 

0 0000000 

0000002 

1.7881572E-07 

3690.1845154 

-00000000 

0 0000000 

.0000012 

1.1589587E-06 

3695  0922077 

-00000000 

0 0000000 

00000000 

0. 

^ABLE  2 - ORIGINAL  AND  RECONSTRUCT  MODEL  FOR  THE  SUN  CORONA 
(RECONSTRUCTION  BY  ABEL  TRANSFORM) 


Propagatio'.  Frequency  = 2000  MHz 


’le'io.entric  Radii. 


7.5786310E+05 
8.4348448E+05 
9.7715388E+05 
1.1 150906  L-h.^ 
1.2538732E+06 
1.39293f\)E-K>6 
].5320%3c+0f 
1.F7I3086E+06 
1. 810554  2E+C6 
1.9498074E+O6 
2.08906 14E+06 
2.2283313E+06 
2.3675974E+Of 
2 5068577E+06 
2S461333E+06 


Differential  Phase  Delay 


-1.4 702246315+03 
-5.845 244 54E+02 
-2.4S342834E  K)2 
-1.2  2995902E+02 
-6.78245537E+01 
-3.9931 3 143E+01 
-2  47259222E+C1 
-1.59462278E+01 
-1.06331003E+01 
-7.28939208E+00 
-S.11282164E+00 
-3.65362583E-HX) 
-2.64969763F+00 
-1.94251251F+00 
-1.43321624E+00 


Refractivity  (Original) 

-4.3',316152E+03 
-6.82627600E+02 
-2.210601 35E+02 
-9.4  /;53676E+01 
-4.61757491E+01 
-2.44567678E+01 
-1.379007!9E+0i 
-8.17848559E+00 
-5.05856642E+OC 
-3  24259447E+00 
-2.14S43004E+00 
-1.45520568E+00 
-1.01 145934E+00 
—7.1 7816349F— 01 
- 5.1 8941981 E— 01 


Refractivity 

(Reconstructed) 

-1.8M35583E+03 
-5.54988338E-02 
-1.98717232E+02 
-9.22099868E+01 
-6.52852467E+01 
-2. 27  4605  88E+0 1 
-1.29625256E+01 
-7.746  20536E+00 
-4.82507421  E+OO 
-3.1 15484  79E+00 
-2.0771 2544E+O0 
-1.42693223E+00 
-1.010%1 16E+00 
-7.449765  73E-01 
-6.25953850E-01 


Reluctivity 

Difference 

2.47880569E+03 
1.2763926  2E+02 
2.23429028E+01 
2.50538070E->00 
-1.91094976E+01 
1.67070897E+00 
3.2754633  ZE— 01 
4.3228023 1E-0! 
2.33492208E-01 
1.2,10%77E-01 
6.63045950E-02 
2.82734498E-02 
4.98183008E-04 
-2.71602249E-02 
-1.0701 1869E-01 
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THE  FEASIBILITY  OF  DERIVING  THE  IMPACT  PARAMETER 
FROM  THE  DOITLER  RESIDUALS 


As  an  immediate  consequence  of  Snell’s  law.  in  spherical  coordinates,  microwave  radio  rays  passing  through 
a refracting  medium  are  characterized  by  the  fact  that  the  product  of  the  index  of  refraction  n.  the  radius  r.  and 
the  sine  of  the  zenith  angle  z is  a cons  tan* 

In  geometric  optics  treatment  of  microwave  propagation,  this  constant  is  of  fundamental  importance.  It  is 
r.ot  only  the  product  n<yr0  at  the  point  of  tangency  and  r(sin  z)  at  any  point  beyond  the  atmosphere,  but  it  is 
also  the  radial  perpendicular  distance  to  the  original  undisturbed  ray  path,  or  asymptote.  It  is  therefore  the 
necessary  parameter  to  trace  a ray  tniough  a g 'en  atmosphere,  and  it  is  a vital  quantity  in  inversion  problems. 

In  paiticle  physics,  the  perpendicular  distance  from  the  center  of  a force  field  to  the  velocity  vector  of  an 
entering  charged  particle  is  called  the  impact  parameter.  The  analogy  with  the  radio  rays  is  crude  since  the  atmospheric 
refractivity  cannot  be  considered  truly  equivalent  tc  a force  field  and  our  ravs  are  not  particles  in  many  respects; 
nevertheless,  “impact  parameter”  seems  to  be  an  appiopriate  name  for  the  rays’  fundamental  constant.  The  exact 
inversion  technique  for  phase  data  based  on  the  use  of  this  parameter  was  presented  by  Phinney  and  Anderson  [1968] . 
Also,  Tatarsky  [1968a.  b]  utilized  this  term  as  the  radius  at  the  tangent  point  r0. 

The  feasibility  of  deriving  ihis  parameter  from  the  Doppler  residuals  has  been  debated  in  the  literature.  The 
analytical  relationships  between  impact  parameter  and  Doppler  residuals  derived  by  Phinney  and  Anderson  were 
questioned  (incorrectly,  in  our  opinion)  by  Graves  and  Fischbach  [ 1969] . who  argued  against  Phinney  and 
Anderson’s  equations  for  the  eiconal  and  the  directional  derivatives  in  terms  of  Doppler  shifts.  These  equations 
are  fundamental  and  give  the  ray  direction  with  respect  to  the  spacecraft  trajectory  as  a function  of  the  Doppler 
residuals.  Fischbach  [19701  later  reduced  the  weight  of  his  objectives. 

To  evaluate  the  radio  ray  emission  angle  from  the  directional  derivative  it  is  necessary  that  the  gradient  of  the 
phase  or  the  eiconal  equation  be  determined.  We  assume  an  infinite  nonconducting  isotropic  medium  where  the 
inductive  capacity  e is  a function  of  position.  Then  Maxwell’s  equations  for  electric  field  vector  become: 

v5E  + A2E  = = vJE  + A2n-E  (11) 

where  Kg  = fic  is  the  propagation  constant  m;.-sured  in  cycles.  If  we  assume  that  the  spatial  change  of  e is 
small  compare^  a wavelength 


<<  — 

e X 


then  the  wave  equation  reduces  to  its  homogeneous  form 


(12) 


v2E  + Aj«*E  = 0 (’3) 

Assume  that  the  form  of  the  solution  is; 

E - A{x,y,z)e^tjS(x,y,z)  (]4) 


340 


where  S = Sr  + iSj  so  that 

Sr  = constant,  are  surfaces  of  constant  phase 
5(-  = constant,  are  surfaces  of  constant  amplitude 

Substitution  of  Eq.  (14)  into  Eq.  (13)  yields: 

K2qA  [ n2  -M')J]  - iK0  [/f^S  + 2(vA)(vS)j  - 0 (15) 


If  K0  is  large  in  the  sense  that 


and 


vM 


«K20 


Av2S  + 2(vA)(vS)  «K0 


(16) 


(17) 


then,  excluding  regions  of  diffraction,  focal  points, 
caustics,  and  sources,  we  obtain 


(vS)1  = n2 
Now  KqS  = <t>  + ft  so  that 


which  is  the  eiconal  equation. 


(18) 


(19) 


Since  n is  a function  of  position,  if  the  space- 
craft S in  figure  4 is  inside  the  planet  ry  atmosphere, 
the  profile  of  the  atmosphere  must  be  known  before 
Eq  (19)  can  be  solved  for  |v0|. 

If,  however,  the  spacecraft  is  above  the  atmo- 
sphere, then  n becomes  unity  and  Eq.  (19)  becomes 


(v0)J  =^— j (20) 

The  last  equation  is  identical  to  the  eiconnl  of 
Phinney  and  Anderson  [ 1 968] . Equation  (20)  is  now 
in  a solved  form  and  ca-  be  substituted  into  the 
equation  for  the  directional  derivative  to  solve  for  the 
emission  angle. 


Figure  4.-  Geometry  of  the  occultation  experiment. 
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The  angle  between  the  ray  path  and  the  unperturbed  ray  path  is  the  deviation  angle,  and  according  to  our 
calculation  it  is 


1 

tan  — - <pe) 

*7 


[$  ~ (Pe  ♦ Jatm)2]12  - fa* -/g)'* 

®atm  + 2^e 


(21) 


where 

De  = the  Doppler  shift  of  the  unperturbed  ray 
Datm=  the  Doppler  residual  due  to  the  atmosphere 
Ds  = f/cs. 


PROFILE  INVERSION  WITH  BENT  RAYS  BY  THE 
SEISMOLOGICAL  APPROACH 


The  straight-ray  approximation  can  be  dropped  when  referring  to  inversion  approaches  like  the  Her^lotz- 
Wiechert  method.  In  seismology,  the  observable  is  the  central  angle  6 subtended  by  seismic  rays  in  the  earth 
(fig.  4);  and  the  analytical  steps  are  arranged  in  such  a way  that  the  velocity-depth  profile  is  obtained  by  operating 
on  8.  In  radio  occupation  measurements,  the  data  are  obtained  in  the  form  of  Doppler  shift  residuals.  Phinney 
and  Anderson  [1968]  have  shown  how  he  Heiglotz-Wiechert  approach  can  be  applied  to  these  Doppler  residuals, 
leading  to  the  accurate  reconstruction  of  the  refractive  index  profile,  in  a condition  of  ray  bending,  strong  as  it 
may  be. 

The  geometry  of  the  occupation  is  shown  in  figure  4 The  index  of  refraction  n is  a radially  dependent 
function  and  from  Fermai  s principle  the  first  variation  of  the  phase-path  length  should  be  zero. 

5/nds  = 0 (22) 

where  ds1  = dr3  + r2  d02 . The  Euler-Lagrange  equation  that  yields  a minimum  for  Lq.  (22)  is 


nr 2 (dfl/dr) 

, "Tt  i - constant 
1 + r2  (id'dr)2  J 


(23) 


and  from  the  boundary  conditions  at  the  minimum  'adius.  Snell’s  law  ir.  spherical  geometry  is  determined.  That  is 

p - nr  sm  / = np  ip  = impact  para,  eier  (24) 

and  the  impact  parameter  is  a constant  for  » given  ray. 

Now  let  us  define  the  variable  tj  as 


sin  i 
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From  the  definition  of  the  path  length 


and  the  application  of  Eqs.  (23)  through  (25),  the  integral  function  for  the  path  length  is 


S = 


nr(t)2  -p2)  12 


dr 


(26) 


(27) 


The  phase  (angular  measure)  along  the  ray  path  is 


0 = 


(28) 


and  using  Eqs.  (25)  and  (27) 


we  then  apply  the  operator 


(29) 


(30) 


where  > r)i  > rip,  to  equation  (29).  Then,  by  interchanging  the  order  of  integration,  integrating  by  parts 
repeatedly,  and  performing  a final  integration  over  the  appropriate  regions,  the  phase  function  (29)  can  be  shown 
to  be  inverted  to 


Kr>)  = r0  exp 


c 


rv 

» 

P 

i d0 

I cosh 

p(v) 

~ ~ -dp 
P dP 

'n0 

(31) 


Thus  the  refractivity  is  found  as 


m = 


pin) 


o exp  - jc/jr/  fV  cosh-1  \p/p(r>)\  1 Ip  d0/dp  dpi 


- 1 


(106)  (32) 
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Following  Phinnev  and  Anderson  [1968] , if  we  define  a ray  residual  <t>a 


where  <t*a  is  a geometrical  term,  then 


-V(x)  = 


“pii/ 

J*o 


cosh  1 


’p(x)' 

1 d<J>c 

p(xl) 

p(x)  dx 

(106) 


(33) 


(34) 


where  x is  a dummy  variable  such  as  time  to  occultation  or  satellite  position. 


In  our  computer  simulated  experiment,  the  impact 
parameter  p is  calculated  directly  from  the  assumed 
trajectory  and  from  the  Hamiltonian  ray  tracing  program. 
Therefore,  a suitable  impact  parameter  history  is  known 
and  two  forms  for  the  recovered  refractivity  and  minimum 
probing  radius  were  calculated:  £q.  (32),  since  the  data 
are  available  from  the  generating  program  and  it  will  conform 
Eq.  (31);  and  Eq.  (34),  where  is  found  in  terms  of  the 
actual  observed  phase  shift. 

Data  have  been  generated  for  a Martian  atmosphere 
model  similar  to  the  one  developed  from  the  Mariner  4 
results.  For  this  example,  ray  bending  is  small  (approxi- 
mately 40  /arad)  and  the  straight-ray  Abel  transform  has 
been  shown  in  the  past  to  be  a fully  adequate  approximation. 
The  recovered  refractivity  from  the  straight  ray  Abel  transform 
(ABEL)  and  seismic  inversion  (SEISMIC),  and  the  model 
refractivity  (MODEL)  versus  geocentric  radius  of  Mars  are 
plotted  in  figure  5 for  the  measured  phase  function. 

Table  3 shows  that  for  the  thin  Martian  atmosphere, 
the  errors  in  recovered  refractivity  with  ABEL  and  SEISMIC 
are  roughly  the  same.  Also,  the  minimum  probing  radius 
found  by  tb'  straight-ray  Abel  transforms  is  very  close  to 
the  actual  radius,  due  to  the  small  amount  of  ray  bending. 
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Figure  5.~  Model  of  Mars  refractivity  profile  and  its  recon- 
struction by  abel  transform  and  Herglotz-Wiechert 
inversion  (rectangles  traced  around  the  pairs  of  circles, 
squares,  and  crosses  identify  the  original  and  the 
reconstructed  point). 
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The  Herglotz-Wiechert  approach  has  been  applied  to 
the  model  of  the  lefractivity  of  the  sun’s  corona.  In  this 
example,  ray  bending  becomes  significant,  and  the  deviations 
between  the  classical  straight-ray  Abel  transform  and  the 
seismic  approach  are  clearly  noticeable. 

A Baumbach  electron  density  model  for  the  sun’s 
corona  was  assumed  and  phase  data  for  a frequency  of 
2,000  MHz  were  generated.  The  recovered  refractivity 
profiles  from  ABEL  and  SEISMIC  are  plotted  along  with 
the  original  model  (MODEL)  versus  geocentric  radius  in 
figure  6 for  the  measured  phase  function. 
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i iguve  6.-  Model  of  sun  corona  refractivity  profile  and  its 
reconstruction  by  abel  transform  and  Herglotz-Wiechert 
inversion  'rectangles  traced  around  the  pairs  of  circles, 
squares,  and  crosses  identify  the  original  and  the  recon- 
structed point). 
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TABLE  4. 


As  shown  in  table  4,  the  divergence  of  the  recovered  refractivity  and  minimum  probing  radius  compared  to  the 
original  model  values  increases  as  the  bending  becomes  appreciable.  For  the  last  value  calculated,  the  straight-ray 
Abel  transform  was  in  error  2.47X103N  units  in  the  calculated  refractivity  and  5. 84X 103  km  in  the  minimum 
prooing  radius,  compared  to  the  seismic  error  of  0.92X  103N  units  in  refractivity  and  0.65X  1 03 km  in  minimum 
probing  radius. 


EFFECT  OF  HORIZONTAL  GRADIENTS 


In  the  presence  of  horizontal  gradients,  the  Snell’s  law  equation  for  the  impact  parameter,  which  is  used  to 
recover  the  refractivity,  is  no  longer  valid.  Therefore,  as  the  severity  of  the  horizontal  gradient  becomes  larger, 
ihe  Herglotz-Wiechert  inversion  technique  gives  poorer  reconstruction  o;  the  refractivity  profile.  It  is  thought 
that  b\  tarting  again  from  Fermat’s  principle  for  the  new  geometry,  a better  equation  for  conservation  of  some 
parameter  for  this  type  of  ray  may  be  found.  This  conservation  properly  applied  to  the  Herglotz-Wiechert  approach 
may  yield  better  reconstruction  of  the  refractivity  profile  for  horizontal  gradients. 


CONCLUSIONS  AND  FUTURE  RESEARCH 


The  work  reported  in  this  paper  has  shown  that  profile  inversion  of  refractivity  data  collected  with  sNongly 
bent  rays  requires  the  use  of  techniques  like  the  seismologica!  Herglotz-Wiechert  approach  that  do  not  rely  on  the 
validity  of  the  straight-line  approximation.  Inversion  errors  associated  with  the  application  of  the  straight-ray 
Abel  transform  to  a medium  like  the  sun’s  corona  have  been  shown  to  be  intolerably  high  (larger  than  607)- 

What  still  remains  are  the  application  of  the  method  to  simulated  and  actual  data  characterized  by  instrumen- 
tation phase  errors  and  noise;  the  evaluation  of  the  ability  of  the  algorithm  to  extract  localized  averages  of  the 
occultation  profile  from  columnar  measurements;  the  evaluation  of  the  profile  uniqueness  and  the  related  calcu- 
lation of  the  probability  associated  with  each  one  of  the  “possible”  profiles  obtained  from  the  data  by  inversion; 
the  treatment  of  cases  involving  waveguidance;  and  the  evaluation  of  the  minimum  sampling  rate  required  to 
assuie  a desired  accuracy. 

All  of  these  extensions  planned  for  our  research  are  not  expected  to  invalidate  i.i  any  respect  our  present 
conclusions  indicated  above.  Instead,  they  will  upgrade  the  capability  of  the  bent-ray  approach  for  the  inversion 
of  planetary  atmospheres  and  ionspheres  toward  the  level  of  sophistication  reached  through  decades  of  practice 
by  the  seismologists  in  probing  the  earth’s  mantel  with  seismic  waves. 
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DISCUSSION 


Kliore:  Your  application  of  what  you  call  the  Abel  transform  approach  labels  that  transform  unfairly  as  something 

usable  only  in  the  thin  atmosphere  approximation.  Actually  you  used  the  Abel  transform  in  both  cases.  You 
operate  on  different  functions,  one  representing  a thin  atmosphere  approximation  and  the  other  the  general 
case.  What  you  call  the  seismological  inversion  is  simply  the  general  case  in  which  you  consider  everything 
including  the  ray  bending,  etc  It  is  not  clear  to  me  how  you  obtain  the  Herglotz-Weichert  method.  How 
is  that  related  to  the  observed  phase,  which  is  simply  the  integral  of  the  Doppler? 

Wallio:  4>(a)  is  the  observed  Doppler  residual  minus  a geometric  term  that  is  a straight  line. 

Kliore:  That  is,  <t>  is  simply  the  contribution  of  a phase  path  through  the  atmosphere? 

Wallio:  is  actual  Dopple-  along  the  ray  path  through  the  atmosphere,  like  Phinney  and  Am  arson,  but  you  don’t 

measure  that. 

Kliore:  You  seem  to  imply  it  was  not  generally  known  to  use  the  thin  atmosphere  approximation;  what  you  call 

the  Abel  transform  method  would  not  give  the  correct  result  in  thick  atmospheres. 

Wallio:  1 did  not  mean  to  imply  that. 

Gross:  The  only  limitation  on  the  Abel  translorm,  assuming  spherical  symmetry,  is  the  monotonic  variation  of  the 

product  of  the  index  of  refraction  fi  and  the  radius  r.  Mathematically , however,  the  Abel  transform  and  the 
seismological  technique  are  essentially  the  same.  Therefore,  one  would  expect  to  get  the  same  results.  Why 
don’t  you? 

Wallio:  In  what  1 called  the  Abel  transform  (straight-line  approximation),  which  reduces  to  what  in  most  books  is 

called  Abel’s  integral  equation,  the  general  seismological  case  reduces  to  that.  If  you  have  bending,  however, 
you  are  no  longer  accounting  for  it  in  the  Abel  transform. 

Gross:  That’s  not  true.  Once  you  assume  spherical  symmetry  the  only  limitation  is  the  monotonic  variation  of 

the  product  refractive  index  times  the  radius.  You  can  have  extensive  bending,  but  if  you  violate  this  one 
condition  then  you  have  a limitation  on  the  Abel  transform. 
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ERROR  ANALYSIS  IN  THE  MARINER  6 AND  7 


OCCULT ATION  EXPERIMENTS 
Richard  W.  Stewart  and  Joseph  S.  Hogan 
Institute  for  Space  Studies 

Goddard  Space  Flight  Center,  NASA,  New  York,  New  York 

' ^7  3-  11599  abstract 

The  Mariner  6 and  7 occultation  experiments  provide  sets  of  Doppler  residuals  from  which  physical  properties 
of  the  Mariian  atmosphere  may  be  derived,  such  as  the  ionospheric  electron  density  distribution  and  the  lower 
atmosphere  tempt ature  profile.  The  Doppler  residuals  contain  both  systematic  and  random  errors.  The  former  are 
removed  during  the  data  reduction,  and  the  latter  are  analyzed  to  yield  error  limits  on  the  derived  physical  properties 
of  the  atmosphere. 


INTRODUCTION 


The  Mariner  series  of  planetary  probes  hr.  yielded  valuable  information  on  the  vertical  distribution  of 
temperature  and  electron  density  over  six  points  on  Mars  and  two  points  on  Venus.  An  important  aspect  of  the 
occultation  data  analysis  is  the  determination  of  confidence  limits  for  the  physical  properties  derived  from  the 
Doppler  residual  data.  This  paper  describes  a general  approach  to  the  question  of  deriving  uncertainties  in  quantities 
obtained  from  the  occultation  data  and  applies  this  approach  to  the  Mariner  6 and  7 experiments.  Although  the 
method  is  general,  the  derived  uncertainties  in  physical  quantities  depend  on  the  series  of  mathematical  operations 
performed  on  the  Doppler  residuals  to  obtain  those  quantities  and  the  formulas  given  in  some  instances  in  this  paper 
therefore  cannot  be  used  for  experiments  such  as  the  Mariner  S Venus  occult  ion  where  the  refraction  angle  becomes 
large. 


There  are  two  types  of  errors  involved  in  deriving  physical  characteristics  of  the  Martian  atmosphere  from  the 
occultation  data:  systematic  errors  due  to  effects  such  as  oscillator  drift  and  imperfect  knowledge  of  spacecraft 
trajectory,  and  random  errors  due  to  the  noise  level  of  the  data. 

Systematic  errors  are  manifest  in  nonzero  Doppler  residual  values  at  distances  sufficiently  far  from  the  planet 
that  atmospheric  effects  cannot  be  present.  We  assume  that  these  errors  can  be  removed  from  the  data  by  subtracting 
a bias  curve,  fitted  to  a subset  of  pre-  or  postencounter  points,  from  the  entii.  set  of  Doppler  re:  duals. 

Our  analysis  of  random  errors  in  derived  atmospheric  characteristics  is  based  on  the  following  assumptions: 

1 . Systematic  errors  are  removed  from  the  data  as  stated  above.  Errors  in  derived  quantities  thus  reflect  only 
the  random  errors  inherent  in  the  data. 

2.  Errors  in  derived  quantities  are  produced  only  by  errors  in  the  Doppler  residuals.  Uncertainties  in  other 
parameters  entering  into  the  analysis,  such  as  wavelength  of  the  telemetry  signal  and  spacecraft  position, 
are  either  negligible  compared  to  errors  in  the  residuals  or  are  systematic  and  hence  removed  in  the  data 
reduction. 

3.  The  errors  in  Doppler  residuals  are  statistically  independent. 
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DATA  REDUCTION  AND  ERROR  ANALYSIS 


Removal  of  Svitematic  Error 

Ideally,  in  the  pre-  and  pcstencounter  periods  when  atmospheric  effect',  are  not  present  in  the  telemetry 
signal  received  from  the  spacecraft,  the  Doppler  residuals  should  be  scattered  abom  zero- the  scatter  reflecting 
the  random  noise  in  the  data.  In  practice,  systematic  errors  may  be  present  that  prcduce  nonzero  residuals  in 
regions  far  removed  from  the  planet.  These  systematic  errors  are  removed  from  the  data  by  performing  a poly- 
nomial regression  analysis  on  the  Doppler  residuals  over  selected  time  periods  in  the  preencounter  (entry)  and 
postencounter  (exit)  periods.  The  resultant  least-squares  fitted  bias  curve  is  then  extrapolated  through  the 
encounter  period  and  subtracted  from  the  complete  set  of  Doppler  residuals  to  obtain  a set  of  corrected 
residuals. 

Figure  1 illustrates  this  procedure.  The  dots 
represent  averaged  residual  data  from  three  stations 
obtained  during  Mariner  6 entry;  as  can  be  seen, 
there  is  a large  bias  in  this  particular  set  of  residuals. 

The  curved  line  through  these  raw  residua’s  is  a 
third-order  polynomial  fit  to  a preencounter  set  of 
these  points  and  is  extrapolated  through  the  encounter 
period.  The  corrected  residuals  are  connected  by  the 
solid  line  running  below  the  raw  residuals. 

The  error  involved  in  extrapolating  the  bias 
curve  through  the  encounter  period  cannot  he  formally 
estimated  and  for  this  reason  an  additional  constraint 
must  be  placed  on  the  daia  reduction.  We  assume  that 
there  exists  a region  of  essentially  zero  refractivity  in 
the  middle  atmosphere  of  Mars  where  the  electrons 
are  too  few  to  give  a negative  contribution  to  the 
refractivity  and  the  neutral  atmosphere  too  tenuous 
to  give  a positive  contribution.  Most  analyses  of  the 
corrected  residuals  do  show  a region  of  nearly  constant, 
but  not  necessarily  zero,  refractivity  in  the  middle 
atmosphere;  that  is,  a refractivity  ledge  occurs  in  this 
region.  Refractivity  ledges  that  deviate  from  zero  by 
more  tliati  an  average  of  O.O'I  refractivity  units  are  assumed  to  represent  cases  in  which  the  extrapolation  of  the  bias 
curve  through  the  encounter  period  results  in  large  errors  in  the  "slues  of  refractivity  near  the  planet.  Such  cases 
are  rejected,  a..d  the  procedure  of  fitting  a bias  curve  is  repeated  on  other  sets  of  residual  points  until  a near-zero 
refractivity  ledge  is  obtained. 

The  third -order  polynomial  fit  exhibited  in  figure  1 is  not  a best  fit  in  the  sense  of  minimizing  the  sum  of  the 
squared  deviations  of  the  fitted  points  from  the  bias  curve.  Best  fits  in  this  sense  are  obtained  with  higher  order 
polynomials,  but  these  polynomials  are  erratic  when  extrapolated  outside  the  region  of  fit.  A change  of  a single 
point  in  the  set  selected  for  the  polynomial  fit  can  result  in  radically  different  extrapolated  values  outside  the 
region  of  fit  for  higher  order  polynomials,  but  the  third  order  polynomials  are  fairly  stable  to  such  changes. 

We  must  emphasize  that  this  procedure  foi  correcting  systematic  errors  in  the  data  is  an  assumption  and  that 
the  effects  of  such  errors  on  the  results  cannot  be  rigorously  analyzed.  We  believe  their  effect  can  be  estimated 
however,  by  reducing  data  for  a large  number  of  near-zero  ledft  cases  obtained  after  subtracting  a bias  curve  fitted  to 
several  different  sets  of  residual  points  in  the  pre-  and  postencounter  periods.  Accordingly,  the  results  for  the 
various  atmospheric  profiles  presented  in  this  paper  are  averages  resulting  from  analyses  of  300  near -zero  leuge  cases 
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Figure  1.-  Raw  residuals,  polynomial  fit,  and  corrected  residuals 
\for  Mariner  6 entry. 
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for  each  entry  and  exit.  The  standard  deviations  in  these  average  results,  when  large  enough,  ? re  indicated  in  figures 
by  a shaded  region  about  the  average  profile,  while  typical  standard  deviations  due  to  random  errors  are  indicated 
by  erroi  bats. 


General  Procedure  for  the  Determination  .f  Random  E.ur 

The  polynomial  regression  analysis  described  above  yields  the  variai  ce  o;  the  Doppler  residual  points  to  wh'ch 
the  bias  curve  is  fitted.  Since  the  physical  characteristics  of  the  Martian  atmr  sphere  are  derived  from  these  residuals 
the  initial  error  will  propagate  through  the  calculations  and  will  ultimately  appear  r,s  error  bars  on  the  various 
atmospheric  profiles.  We  denote  the  variance  in  the  Doppler  residuals  at  the  ;th  level  in  the  atmosphere  by  u 
where  the  superscript  D refers  to  Doppler  residuals.  The  variance  in  a derived  quantity  Q at  the  nth  level  r the 
atmosphere  is  then  [Been,  1957] . 


(1) 


where  Qn  is  the  value  of  Q at  the  nth  level  and  is  assumed  to  be  a function  of  the  Doppler  residuals  at  altiiuue  n 
and  at  all  higher  altitudes- that  is,  at  all  levels  / < n. 


It  is  convenient  to  define  two  matrices  M and  S by 


!^U 

«/ 


2£l 

a Ui 


r-(M  ]■(•?) 


(2) 


That  is,  the  ;'th  element  of  the  matrix  to*1'  is  the  derivative  of  Q atauitide  i v ith  respect  to  U at  Ititude  /; 

Q and  U are  any  two  general  quantities  derived  from  'or  equal  to)  the  Doppier  r,si  juals.  The  matrix  is 
obtained  by  .^Hiring  the  mdividi'al  element . of  vfl*-  Sinre  all  derived  quantify  are  obtained  by  dr  inward 
integration  'rom  the  top  of  the  atmosphere,  their  values  at  a given  altituos  do  ur,  depend  on  the  res?  iuals  at  lower 
altitudes.  With  a single  exception  noted  late' , all  the  matrices  used  in  this  analysis  are  therefore  either  lower 
triangular  or  diagonal.  The  specific  methods  used  in  obtaining  the  random  mror  in  the  various  atmospheric  quantities 
deriv'd  from  the  Doppler  residuals  are  enumerated  below. 


Phase  Path 

The  phase  path  is  computed  hv  time  integration  of  the  Donpler  residuals  according  to  the  trapezoidal  rule: 


where  is  the  timu  interval  between  the  residual  measurements  at  times  ty  j and  * j . the  uatux  elements 

involved  in  calculation  of  the  errors  in  the  phase  path  increase  are 
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The  phase  path  variance  is 

o P = S«>  • (5) 

where  is  the  vector  whose  components  are  the  phase  path  variances  at  points  o®  is  the  (constant)  vector  of 
the  residuals  variance,  and  S ^ is  defined  by  F.qs.  (?)  and  (4).  If  we  assume  a constant  A t in  Eq.  (3)  then  it  is 
easy  to  show  that  the  standard  deviation  in  the  phase  points  increases  with  decreasing  altitude  approximately  as 
v/T where  n is  the  number  of  quadrature  points. 

The  magnitude  of  the  errors  in  phase  path  resulting 
front  the  random  error  in  the  Mariner  6 entry  residuals 
in  figure  1 is  shown  by  the  error  bars  in  figure  2.  The 
maximum  standard  deviation  in  phase  path  is  about 
0.63  cycles  (out  of  ~8  cycles)  near  the  ground.  The 
shaded  region  in  figure  2 shows  the  magnitude  of  the 
standard  deviation  in  average  phase  path  (given  by  the 
solid  line)  resulting  from  the  reduction  of  300  cases. 

The  maximum  value  of  this  standard  deviation  is  about 
0.49  cycles  and  occurs  near  the  ground. 


Straight-line  Approximation  to  Phase 

The  straight-line  approximation  to  the  phase  path 
P'  is  \Fjeldbo  and  Eshleman,  1968] . 

■ F>  ’1 4 «" 

where  Z,-  is  the  distance  of  the  spacecraft  behind  the  planet,  X is  the  wavelength  of  the  telemetry  signal,  and  a,- 
is  the  angle  through  which  the  ray  is  bent  in  traversing  the  atmosphere.  As  before,  the  subscript  i denotes  the  r'th 
altitude.  The  variance  in  P1  is  given  by 

</’  = S P'D  . 00  ( 
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Figure  2.-  Avenge  phase  path  for  Mariner  6 entry  obtained 
from  three  hundred  reductions  of  Doppler  residual  data, 
using  different  bias  curves.  The  standard  deviation  in  the 
average  is  indicted  by  the  shaded  area.  The  error  bars  in- 
dicate the  magnitude  of  'he  standard  deviation  'ue  to 
random  error  derived  in  any  single  reduction  of  the  data. 
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where  a p is  the  vector  whose  components  are  the  variances  of  p'  at  levels  i.  The  matrix  ® is  computed 
using  (2)  once  is  computed  from  the  equation 


M P D = • Mpd  + a • M0^ 


The  matrices  Mp  p and  a are  calculated  from  the  definition  (2)  using  Eq.  (6)  to  relate  P‘  and  P and  a. 
has  already  been  computed  in  deriving  the  phase  path  errors,  and  can  be  computed  from  the  relation 


where  l)  is  the  component  of  the  spacecraft  velocity  along  the  perpendicular  to  the  Earth-Mars  line  at  altitude  i. 

The  matrix  p in  the  first  term  on  the  right-h3nd  side  of  (8)  is  the  unit  matrix  while  both  a and  M01®  are 
dngonal  and  hence  their  product  is  diagonal.  The  matrix  & then  differs  from  only  in  the  diagonal  elements 
which  are  given  by 

= — At-,  , + i>  1 (10) 


The  erors  in  P’  are  only  slightly  greater  than  those  in  P and  will  net  be  shown  explicitly. 


The  variance  in  refractivity  is  given  by 


Refractivity 


gR  = $RD  . gD 


The  refractivities  are  not  given  explicitly  as  functions  of  the  Doppler  residuals,  but  are  given  in  terms  of  the  straight- 
line  approximation  to  the  phase  path  by  [Fjeldbocnd  Eshleman,  1968] . 


106  \P' 


|l06  X-P-  ~ 2 zj^AZ,7] 


where  AZjj  is  one-half  the  distance  'raversed  by  the  ith  ray  in  the  /th  layer.  The  matrix  required  for  the 
computation  of  S ^ may  there!  ,.tK  obtained  from 
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(13) 


where  M P’D  has  already  been  computed  in  the  analysis  of  P'  errors  and  can  be  calculated  from  the  recursion 

formulae  (12).  If  we  define  a matrix  Z by 
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the  the  matrix  is  computed  from 
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The  standard  deviations  in  refractivity  are  shown  by 
the  error  bars  in  figure  3 for  the  Mariner  6 entry 
case.  These  standard  deviations,  like  those  in  phase, 
increase  slowly  with  decreasing  altitude.  The  maxi- 
mum refractivity  ertor  occurs  near  the  ground  and 
has  a magnitude  of  about  0.068  refractivity  units. 
Near  Tie  electron  density  peak  at  136  km,  the 
standard  deviation  in  refractivity  is  about  0.046. 

The  standard  deviation  in  average  refractivity 
resulting  from  the  reduction  of  300  cases  cannot  be 
readily  shown  on  the  scale  of  figure  3.  Near  the 
ground,  this  standard  deviation  in  average  refractivity 
is  0.039  units  and  decreases  to  0.01 6 units  near  the 
electron  density  maximum. 


Electron  Density 

The  basic  equation  for  the  variance  in  electron 
density  is 


5 
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MARINER  6 ENTRY 


Figure  3.-  Avenge  refractivity  for  Mariner  6 entry  obtained 
from  three  hundred  reductions  of  Doppler  residual  data 
using  different  bias  curves.  The  standard  deviation  in  the 
average  is  too  small  to  be  shown  on  this  scale.  The  error 
ban  indicate  the  magnitude  of  the  standard  deviation  due 
to  random  error  derived  in  any  single  reduction  of  the  data. 


aN  = SND  . aD  (17) 

The  matrix  required  for  the  computation  of  is  obtained  from 


mM>  = mnr  • mrd 


(18) 
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We  computed  iu  the  analysis  of  the  refractivity  errors,  and  is  a multiple  of  the  unit  matrix,  which 

is  readily  calculated  from  the  relationship  between  electron  density  and  refractivity 
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-J Hi , 

4C3X1012 


(cm  3) 


(19) 


where  ] is  the  frequency  of  the  telemetry  signal  in  Hz.  The  matrix  is  thus 


/ 1 
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(20) 


where  6 ■ ■ is  the  Kronecker  delta.  The  standard 
deviation  in  electron  density  for  Mariners  6 and  7 
are  shown  by  the  error  bars  in  figures  4 and  5 . Since 
the  electron  density  is  proportional  to  refra'tivity 
these  standard  deviations  increase  slowly  with 
decreasing  altitude.  The  shaded  areas  in  figures  4 and 
* give  the  standard  deviation  in  average  electron 
density  resulting  from  300  reductions  of  the  data. 


Temperature 

The  temperature  variance  is  given  by 


Figure  4.-  Average  election  density  for  Mariner  6 entry  obtained 
from  three  hundred  reductions  of  Doppler  residual  data  using 
different  bias  curves.  The  standard  deviation  in  the  average  is 
indicated  by  the  shaded  area.  The  error  bars  indicate  the 
magnitude  of  the  standard  deviation  due  to  random  error 
derived  in  any  single  reduction  of  the  data 
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TD 

The  matrix  W required  for  the  computation  of 
STD  is  obtained  from 


Figure  5.-  Same  as  figure  4 for  Mariner  7 entry. 


MW  = MTR  • (22) 


where  ha  s been  calculated  in  the  analysis  of 

the  refractivity  errors,  and  is  calculated  from 
the  equations  relating  temperature  to  refractivity. 
Since  the  temperature  at  *he  top  of  the  lower 
atmosphere,  say  Tt , is  computed  from  the  scale 
height,  determined  by  the  two  uppermost  refractivity 
points  (the  two  points  just  below  me  ledge),  the 
matrix  M TR  lias  a single  nonzero  element  above 
the  diagonal  and  is  the  exception  noted  previously 
to  the  set  of  diagonal  and  lower  triangular  matrices. 
The  matrix  elements  at  the  initial  altitude  are 
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where  m is  the  assumecl  mean  molecular  mass,  f is  the  average  acceleration  of  gravity  in  the  uppeimost  layer 
A/il3,  and  k is  Boltzmann’s  constant.  The  temperature  distribution  in  the  lower  atmosphere  is  related  to 
refractivity  by  [Fjeldbo  and  Eshleman,  1968], 


R(iio)  1 

T(h)  = T{ho) + 

R(h)  kR(h) 


gtt)M(l)R($)  d$ 


ho  refers  to  the  initial  altitude  and  h to  some  altitude  below  ho.  If  we  use  the  trapezoidal  rule  to  integrate 
Eq.  (25)  then  the  diagonal  matrix  elements  of  are 
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Ah , j Ah,_i  ; 

x = (giRi  + •••  + te,--i*i-i  *gfRt)  - 2 1 


and  the  off-diagonal  elements  are 


-^HAh/..i/.+^Ah//+i] 


The  standard  deviations  in  temperature  are  shown  by  the  error  bars  in  figures  6 and  7.  The  shaded  areas  in  figures 
6 and  7 give  the  standard  deviation  in  average  temperature  resulting  from  300  data  reductions. 
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I igure  6.-  Average  dayside  temperature  profiles  for  Mariner  6 
and  7 obtained  from  three  hundred  reductions  of  Doppler 
residual  data  using  different  bias  curves.  The  standard  deviations 
in  these  averages  ate  indicated  ty  the  shaded  areas  where  it  is 
large  enough  to  be  clearly  shown.  The  error  bars  indicate  the 
magnitudes  of  the  standard  deviations  due  to  random  errors 
derived  in  any  single  reduction  of  the  data. 


Figure  7 - Same  as  figure  6 for  the  Manner  6 and  7 nightside 
temperatures. 


DISCUSSION 


The  analysis  of  the  propagation  of  random  error  in  the  Doppler  residuals  through  the  data  reduction  procedure 
shows  that  physical  properties  of  the  Martian  atmosphere  such  as  the  vertical  distributions  of  electron  density  and 
lower  atmosphere  temperature  can  be  determined  with  good  accuracy.  The  standard  deviation  shown  by  the  error 
bars  in  figures  2 through  7 are  properly  regarded  as  lower  limits  to  the  actual  error  present  in  these  quantities  since 
the  effect  of  systematic  errors  cannot  be  rigorously  analyzed.  We  have  estimated  the  effect  of  systematic  errors  by 
analyzing  a large  number  of  near-zero  ledge  cases  obtained  after  subtracting  a bias  curve  fitted  to  several  different 
sets  of  residual  points  in  the  pre-  and  postencounter  periods. 

Generally,  that  standard  deviation  in  the  average  profile  (or  these  cases  is  less  than  the  standard  deviation  in 
any  one  profile  due  to  random  errors  in  the  residuals.  The  bias  removal  technique  employed  in  the  data  reduction 
thus  gives  reliable  results  as  long  as  the  constraint  of  a small  refractivity  ledge  in  the  middle  atmosphere  is  imposed 
and  it  appears  that  the  effect  of  systematic  errors  can  be  nearly  eliminated  by  subtracting  a least  squares  fitted  bias 
curve  from  the  residual  data.  We  cannot  prove  however,  within  the  confines  of  the  occultation  experiment  itself, 
that  our  results  are  valid  when  systematic  errors  are  present,  out  if  they  are  assumed  valid  then  the  random  error 
limits  should  provide  a realistic  estimate  of  the  actual  uncertainty  in  physical  properties  derived  from  the  Doppler 
residuals. 


ACKNOWLEDGMENTS 


The  authors  wish  to  extend  their  thanks  to  Dr.  A.  Kiiore  for  providing  both  the  Doppler  residual  data  and 
several  useful  discussions  regarding  data-reduction  procedures.  Dr.  L.  Russell  was  of  assistance  in  programming 
the  calculations. 


3-58 


DISCUSSION 


Kliore:  This  is  an  excellent  piece  of  work.  However,  another  source  of  systematic  error  is  the  fact  that  you  have 

to  choose  where  to  remove  the  refractivity  bias  once  you  have  obtained  the  refractivity  curve.  1 found  that 
that  does  make  a significant  difference  in  the  surface  conditions  of  pressure  ar.d  temperature.  But  the  temp- 
eratures are  still  within  your  error  bars. 

Stewart:  In  this  procedure,  if  you  insist  on  getting  a zero  ledge  in  the  first  place  then  we  really  have  to  remove 

a refractivity  bias  before  you  analyze  the  data.  Of  course,  you  never  get  to  zero,  but  you  can  get  down  to 
KT4. 


ERROR  ANALYSIS  OF  BENT-RAY  RaDIO  OCCULT ATION  MEASUREMENTS 

R.  L.  St.  Germain 


Raytheon  Company,  Sudbury,  Massachusetts 


f N73- 


11600 


The  radio  occultation  method  of  studying  planetary  atmospheres  has  been  u.;ed  successfully  to  study  the 
atmospheres  of  several  planets.  Radial  refractivity  profiles  of  the  Martian  atmosphere  have  been  achieved  by 
Mariners  4. 6 and  7 and  of  the  Venusian  atmosphere  by  Mariner  5 \Fjeldbo  and  Eshleman , 1968,  Fjeldbo  et  al. , 

1970:  Kliore  et  al. , 1965;  i\l i ore  et  al. , 1970] . The  Mariner  ’7 1 orbiter  is  expected  to  provide  up  to  several  score 
of  occulation  data  at  various  points  of  the  Martian  surface,  and  the  Viking  ’75  dual  orbiter  to  Mars  will  upgrade  the 
spatial  coverage  and  measurement  accuracy  even  more. 

There  are  two  types  of  experimental  measurement  errors  of  the  Doppler  data  associated  with  the  radio 
occultation.  random  and  systematic.  Random  errors  are  due  to  thermal  noise  in  the  transmission  channel,  and 
the  phase  lock  loop,  and  quantization  error  in  the  digital  circuitry.  These  are  called  noise  type  errors.  The  system- 
atic errors  are  due  to  geometric  uncertainty  and  equipment  phase  instability.  It  is  assumed  that  a sufficient  correction 
may  be  made  to  render  the  systematic  errors  small  in  comparison  to  the  random  errors. 

This  paper  concerns  the  amount  of  uncertainty,  due  to  random  measurement  errors,  in  the  refractivity  profiles 
reconstructed  by  this  type  of  indirect  sensing  experiment.  A class  of  refractivity  profiles  is  defined  which  approxi- 
mately (within  a certain  uncertainty)  fit  the  set  of  measured  data.  Bounds  are  placed  on  the  extent  of  this  class  of 
solution  profiles.  To  accomplish  this,  we  must  examine  (1)  the  sensitivity  of  the  reconstructed  refractivity  profiles 
to  errors  in  the  measured  quantity,  that  is,  the  stability  of  the  inversion  process  to  measurement  errors;  and  (2)  the 
statistics  of  the  errors  in  the  measurement. 


We  present  first  a brief  outline  of  the  direct  problem.  Then  we  introduce  a linearized  Taylor  expansion  about 
a given  atmospheric  model  to  determine  the  response  of  the  inversion  operator  to  perturbations  of  the  Doppler  data. 
Numerical  calculations  of  the  rms  refractivity  error  (standard  deviation)  are  presented  for  typical  planetary  atmospheres. 
A region  of  confidence  of  the  reconstructed  profiles  is  related  to  the  standard  deviation  of  the  Doppler  error. 

In  the  radio  occultation  experiment,  the  ray  between  the  spacecraft  and  the  earth  tracking  station  is  refracted 
and  retarded  as  the  ray  path  intersects  the  planetary  atmosphere.  This  change  in  phase-path  length  introduces  a 
Doppler  shift.  With  the  positions  of  the  spacecraft,  planet  under  study,  and  earth  known,  the  geometric  Doppler  due 
to  their  relative  motion  may  be  subtracted  from  the  measured  Doppler.  The  medium-induced  Doppler  frequency 
residual  is  the  input  to  an  inversion  scheme  that  computes,  as  the  end  product,  a radial  refractivity  profile.  The  direct 
problem  involves  integrating  the  differential  phase  path  length  over  the  ray  path.  The  resulting  phase  shift  (in  cycles) 
is 


<Hp) 


O) 


where 

k0  = f/c 

rQ  * radius  of  maximum  extent  of  the  planetary  atmosphere 
rp  - closest  approach  radius  of  the  ray  path 
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The  ray  parameter  is  a characteristic  of  each  ray  and 
is  defined  as 


p - nr  sin  / 


t}  = nr 

n = index  of  refraction 
r - planetocentric  radius 
i = angle  defined  in  figure  1 . 


In  all  practical  situations,  only  a finite  amount 
of  data  exists,  and  therefore  the  Doppler  record  is 
uniquely  determined  by  its  projection  onto  a finite 
dimensional  space.  Consider  a metric  space  D 
containing  all  possible  Doppler  residuals  and  another 
metric  space  R containing  all  possible  radial  refrac- 
tivity  profiles.  Every  Doppler  record  and  refractivity 
profile  is  represented  by  a vector  in  the  corresponding 
space.  Equation  (1)  defines  a transformation  operator 
that  relates  each  atmospheric  model  vector  to  a Doppler 
record  vector  (fig.  2);  that  is, 


4>  o( n) 


(2) 


Figure  1.-  Ray  geometry. 


All  real  data  are  contaminated  by  measurement 
errors.  We  define  the  error  vectors  in  the  Doppler 

phase  residual  and  model  parameter  spaces  by  indirectly  sensed  measured  quantity 

QUANTITY 


£0  = <*>-*o  en  5 n~no 


(3) 


STRUCTURE  (MODEL) 
PARAMETER  SPACE 


DOPPLER  PARAMETER 
SPACE 


where  <t>  and  n are  the  measured  vectors  in  the 
Doppler  phase  and  refractivity  model  spaces,  respectively. 
The  subscript  o represents  the  true  (actual)  quantities. 
Note  that 


<J>0  = o(n0)  and  <t>  = o(n)  (4) 


We  may  expand  each  component  of  the  refrac- 
tivity profile  vector  n by  Taylor  s formula  for  a 
scaler-valued  function  of  several  variables.  Grouping 
the  terms  in  matrix  form  and  applying  Eq.  (3)  yields 
a multidimensional  Taylor  series  (expansion  of  a 


Figure  2.-  Transformation  between  the  atmosphere’s  model 
vector  and  Doppler  record  vector. 


vector-valued  function  of  several  variables)  about  a given  atmospheric  refractivity  mode!: 


en  = K!  + •••  higher  order  terms 


(5) 


where 


Ka  - 


brtj 

d<t> 


(6) 


The  response  to  perturbation  (errors)  can  therefore  be  linearized  about  <P0  by  neglecting  quadratic  and  high  order 
terms  because  is  small.  The  linear  approximation  to  the  perturbation  response  is,  from  F.qs.  (3)  and  (5) 


~ K * e<j, 


(7) 


The  model  parameter  error  vector  is  “proportional  to”  the  Doppler  phase  error  vector  where  the  constant  of  proportion- 
ality is  dependent  on  the  given  atmospheric  model.  From  Eq.  (7)  it  can  be  shown  that  the  variances  of  the  refractivity 
and  Doppler  phase  vectors  are  linearly  related  by  the  factor  K,?.  when  the  Doppler  phase  error  components  are  assumed 
to  be  statistically  independent: 

Dw  - K • KJ  = (Kfj)  (8) 

where  bn  and  D are  the  variances  of  the  refractivity  and  phase  vector. 

The  K matrix  represents  the  incremental  response  of  the  rth  refracticity  profile  component  to  an  incremental 
change  in  ihe  /th  Doppler  phase  component.  The  columns  of  the  K matrix  represent  the  directional  derivatives  of 
the  inverse  function  o_1  (assuming  a unique  inverse  exists).  From  Eq.  (4)  and  the  definition  of  the  directional 
derivative 


o"‘  (<t>+/i4>.)  - o-1  (4>) 

lint  — — 

h~*o  h 


may  be  used  to  evaluate  the  K matrix  directly.  When  the  refractivity  of  the  planetary  atmosphere  is  small,  it  may  be 
shown  that  the  operator  o is  linear.  In  matrix  form: 


4>  = d • n 


Taking  the  inverse 


n = a-1  • 4> 


from  this  form  we  see  that 


K - <T‘ 
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If  we  examine  the  operator  o apply  the  definition  of  p and  tj,  and  discretize  the  integrand  of  Eq.  (1)  by  assuming 
spherically  stratified  concentric  shells,  we  obtain  the  summation 


4>y  = £ n sec  ^ A r .''l 

where  = angle  i at  the  Arth  shell  and  the  /th  ray,  and  A r i,  the  shell  thickness,  and  see  is  known  for  the 
/ th  ray  path.  Taking  the  inverse  of  the  matrix  form  of  Eq.  (9)  results  in  the  matrix  inverse  operator  o~l. 

We  next  turn  our  attention  to  relating  the  variance  vector  of  the  received  Doppler  phase  record  to  known 
quantities.  We  know  the  Doppler  phase  is  the  time  integral  of  the  Doppler  frequency.  The  Doppler  frequency  shift 
is  sampled  every  Ar  sec,  yielding 


*i  - 


Z /».4' 


m=  1 


where  / = ,7ith  Doppler  frequency  sample.  Assuming  the  Doppler  frequency  shift  errors  are  statistically 

independent,  the  variance  of  the  sum  is  equal  tr  the  sum  of  the  variances. 


1 00 
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(10) 


where  Dj  is  the  variance  of  the  Doppler  frequency  shift.  When  we  assume  the  mean  frequency  error  is  zero,  the 
Doppler  frequency  shift  variance  is  equal  to  the  mean  square  Doppler  frequency  measurement  noise: 


Dj*  = Dj-2  = rms  frequency  noise  i ~ 1,2, 3, ...  (11) 

Using  Eqs.  (8),  (9),  and  (10),  the  standard  deviation  of  the  N unit  refractivity  divided  by  the 
Dj- 2 [/V  unit  = (n-l)XlO4]  is  plotted  in  figure  3 assuming  200  1 -second  samples.  The  3o(±3o)  uncertainty 
region  about  a given  model  (Mars)  is  shown  in  figure  4 for  an  rms  noise  error  of  0.10  Hz  (approximate  value  for 
Mariner  4). 

An  uncertainty  in  the  measured  quantity  space  has  been  transformed  to  an  uncertainty  in  the  indirectly 
sensed  quantity  space  by  linearizing  the  response  to  small  measurement  errors.  A region  (volume)  of  confidence 
in  the  Doppler  vector  space  is  defined  in  the  conventional  manner  and  is  assumed  known.  The  corresponding 
region  (volume)  in  the  indirectly  sensed  quantity  space  is  the  confidence  region  of  the  model  space.  The 
uncertainty  region  has  been  plotted  for  several  planetary  atmospheres  with  a physically  realistic  assumption 
of  the  nature  of  the  Doppler  frequency  error.  This  is  intended  to  yield  a quantitative  appreciation  for  the 
deviation  of  the  solution  model  caused  by  realistic  values  of  random  additive  noise  in  the  Doppler  data. 
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AN  OCCULT AT10N  SATELLITE  SYSTEM  FOR  DETERMINING 
PRESSURE  LEVELS  IN  THE  ATMOSPHERE 
Steven  G.  Ungar  and  Bruce  B.  Lusignan,  Stanford  U liversity 

1 1 ^ ^ ABSTRACT 


An  operational  t -vo-satellite  microwave  occultation  system  will  establish  a pressure  reference  level  to  be  used  in 
fixing  the  temperature-pressure  profile  generated  by  the  c RS  infrared  sensor  as  a function  of  altitude.  In  the  final  error 
analysis,  simulated  data  for  the  SIRS  sensor  were  used  to  >est  the  performance  of  the  occultation  system. 

The  results  of  this  analysis  indicate  that  the  occultation  system  is  capable  of  measuring  the  altitude  of  the  300-mb 
level  to  within  about  24  mrms,  given  a maximum  error  of  2°K  in  the  input  temperature  profile.  The  effects  of  water  vapor 
can  be  corrected  by  suitable  climatological  profiles,  and  improvements  in  the  accuracy  of  the  SIRS  instrument  should 
yield  additional  improvements  in  the  performance  of  the  occultation  system. 

The  final  stage  of  the  error  analysis  for  the  two-satellite  microwave  occultation  pressure  reference  level  s)  stem  has 
been  completed.  In  this  stage,  simulated  data  for  the  SIRS  infrared  (IR)  sensor  were  used  to  test  the  performance  of  the 
occultation  system.  The  effect  of  the  oblateness  of  the  earth  was  included  in  the  analysis,  as  was  the  effect  of  water  vapor. 

The  results  of  this  analysis  are  in  agreement  with  previous  results;  that  is,  the  occultation  system  is  capable  of  measuring  the 
altitude  of  the  300-mb  level  to  within  about  24  mrms,  given  a maximum  error  of  2“K  in  the  tempcratuic  profi1*  used  as  input 
for  the  system. 

The  simulated  SIRS  profiles  were  provider!  by  Dr.  William  Smith  of  NOAA,  and  consist  of  206  profiles  of  temperature 
versus  pressure,  with  temperature  given  at  00  pressure  levels  between  1000-mb  and  0.1  -mb.  A typical  data  set  is  shown 
in  figure  1 . Each  level  gives  a simulated  temperature  “sensed"  by  the  IR  instrument,  a temperature  “error”  (how  this  “sensed” 
temperature  differs  from  the  true  temperature),  and  the  error  in  height  resulting  from  the  accumulated  error  in  temperature. 

In  addition,  each  set  tabulates  rms  errou  for  various  ranges  of  pressure  (e.g  . 700  to  400-mb,  400  to  100-mb). 

Half  of  the  data  sets  give  simulated  profiles  for  a seven  channel  rensor,  the  other  half  for  a nine  channel  sensor.  Only 
data  sets  for  the  nine  channel  sensor  were  used  in  this  error  analysis.  The  data  sets  are  divided  into  three  groups  labeled 
set  2,  set  3,  and  set  4 according  to  the  latitude  of  the  stations  in  the  set  These  latitude  ranges  are  indicated  in  table  1 . 


1 iie  pressure-reference  level  program  was  run  as  follows.  The  "sensed" 
temperature  profile  and  the  corresponding  temperature  “errors”  were  com- 
bined to  give  the  “true”  temperature  profile.  This  profile  was  used  as  the 
“reference  atmosphere.” 


TABLE  1-  LATITUDE  RANGES 
Set  No. 

Let.  2 3 4 

Min  25  N 38  N 51  N 
Max  37  N 51  N 76  N 


The  original  “sensed”  temperature  profile  was  used  as  "measured  data.”  An  atmosphere  was  constructed  from  this 
profile  and  shifted  vertically  untii  the  results  of  ray  tracing  through  the  vertically  shifted  "measured”  profile  matched  the 
results  of  ray  tracing  through  the  “reference  atmosphere.”  In  both  the  “reference  atmosphere”  and  the  “measured  atmo- 
sphere” portions  of  the  program,  a random  error  of  absolute  magnitude  0.5  m was  added  to  the  pnase  defect  to  simulate 
the  uncertainty  in  satellite  spacing. 

The  resulting  altitude  error  at  each  of  nine  pressure  levels  was  recorded.  A typical  output  sample  is  shown  in 
figure  2.  Statistical  summaries  for  each  of  the  three  latitude  ranges  are  given  in  tables  2 through  7 Tables  2 through 
4 give  the  results  for  a satellite  aeparation  of  7863  km,  while  tables  5 through  7 give  the  results  for  a satellite 
separation  of  7888  km.  The  greater  the  satellite  separation,  the  lower  the  closest  approach  altitude  of  the  ray. 

Table  8 lists  the  closest  approach  altitudes  for  each  of  the  cases  shown  in  table  2 through  7.  The  results  are  alzo 
shown  in  figures  3 and  4. 

A smallei  number  of  profiles  were  run  with  water  vapor  introduced  into  the  reference  atmosphere  portion  of  the 
program.  The  water  vapor  information  consisted  of  the  associated  rawinsonde  relative  humidity  profiles.  A simplistic 
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Figure  1.-  Typical  input  data  aet. 
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Figure  2.-  Typical  output  of  preuure  reference  ryitem. 
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TABLE  2 - STATISTICS  FOR  14  STATIONS 
SATELLITE  SEPARATION  7863.0000  km  - SET  2 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

7.70 

14.05 

16.02 

-16.89 

28.33 

400.000 

8.18 

16.42 

18.35 

-21.21 

30.48 

300.000 

17.64 

28.40 

33.43 

-30.50 

53.93 

200.000 

24.65 

35.57 

43.28 

-25.81 

92.03 

140.000 

2.64 

33.55 

33.65 

-55.26 

61.61 

100.000 

0.57 

27.10 

27.10 

-45.36 

58.87 

70.000 

4.56 

28.17 

28.54 

-42.50 

56.64 

50.000 

11.02 

38.20 

39.76 

-58.88 

91.92 

.30.000 

10.55 

37.38 

38.84 

-49.27 

58.26 

TABLE  3.-  STATISTICS  FOR  20  STATIONS 
SATELLITE  SEPARATION  7863.0000  km  - SET  3 

Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

1.21 

17.67 

17.71 

-20.61 

36.88 

400.000 

-3.15 

12.97 

13.35 

-24.05 

17.89 

300.000 

-0.07 

16.47 

16.47 

-43.97 

21.99 

200.000 

6.28 

25.77 

26.52 

-51.67 

41. .5 

140.000 

4.90 

42.22 

42.5C 

-83.59 

80.43 

100.000 

-2.32 

39.53 

39.59 

-73.74 

95.77 

70.000 

1.13 

28.35 

28.38 

-33.48 

87.17 

50.000 

9.72 

24.72 

26.57 

-28.99 

77.82 

30.000 

24.42 

40.21 

47.05 

-47.99 

87.49 

TABLE  4 - STATISTICS  FOR  18  STATIONS 
SATELLITE  SEPARATION  7863.0000  km  - SET 4 

Level  (mb) 

Mean 

SD 

RMS 

M'r» 

Max 

500.000 

12.38 

22.83 

25.97 

-;-7.09 

48.48 

400.000 

9.56 

25.29 

27.04 

—3 1 .75 

42.22 

300.000 

-0.82 

25.32 

25.33 

-49.44 

38.81 

200.000 

12.11 

22.26 

25.34 

-12.92 

72.49 

140.000 

14.80 

29.62 

33.11 

-21.11 

95.06 

100.000 

15.39 

33.36 

36.74 

-38.29 

105.33 

70.000 

11.87 

29.70 

31.99 

-41.17 

76.05 

50.000 

1.35 

19.38 

19.42 

-31.85 

30.24 

30.000 

-21.75 

36.20 

42.23 

-78.40 

43.18 

3-68 


TABLE  5.-  STATISTICS  FOR  14  STATIONS 
SATELLITE  SEPARATION  7888.0000  km  - SET  2 


Level  (mb) 

Mean 

SP 

RMS 

Min 

Max 

500.000 

2.72 

16.82 

17.04 

-32.33 

30.76 

400.000 

3.20 

1742 

17.71 

-28.07 

28.5C 

300.000 

12.66 

15.84 

20.27 

-14.57 

34.83 

200.000 

19.67 

22.48 

29.87 

-13.67 

59.76 

140.000 

-2.34 

32.35 

32.44 

-47.95 

48.25 

100.000 

-4.41 

28.62 

28.96 

-62.93 

42.37 

70.000 

-0.42 

23.34 

23.34 

-47.52 

32.24 

50.000 

6.04 

29.24 

29.86 

-32  20 

68.65 

30.000 

5.57 

25.40 

26.00 

-44.91 

44.65 

TABLE 6.-  STATISTICS  FOR  19  STATIONS 
SATELLITE  SEPARATION  7888.0000  km  - SET  3 

Leve’  (mb; 

Mean 

SD 

RMS 

Min 

Max 

500.000 

-3.31 

19.62 

19.90 

-43.18 

27.33 

400.000 

-7.95 

28.79 

29.87 

-68.50 

41.94 

300.000 

-5.63 

45.71 

46.06 

-96.07 

69.94 

2UO.OOO 

-0.87 

43.21 

43.22 

-80.68 

71  70 

140.000 

-0.31 

38.60 

38.60 

-68.58 

68  34 

100.000 

-6.19 

37.56 

38.12 

-60.80 

7"  .5 1 

70.000 

-2.94 

30.74 

30.88 

-51.80 

72.04 

50  000 

4.29 

25.10 

25.47 

-33.07 

51.54 

30.000 

15.71 

37.80 

40.93 

-47.12 

89.89 

TABLE  7.-  STATISTICS  FOR  18  STATIONS 
SATELLITE  SEPARATION  7888.0000  km  - SET  4 

Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

14.07 

35.69 

38.36 

-29.69 

90.63 

400.000 

11.25 

28.34 

30.49 

-17.20 

76.93 

300.000 

0.87 

27.49 

27.51 

-40.40 

56.26 

200.000 

13.81 

49.56 

51.45 

-56.19 

103.93 

140.000 

16.49 

58.22 

60.51 

-60.69 

133.87 

'00.C30 

17.08 

54.47 

57.08 

-40.43 

140.75 

70.000 

13.56 

50.62 

52.40 

-47.56 

127.72 

50.000 

3.04 

49.95 

50.04 

-79.01 

109.18 

30.000 

-20.06 

63.61 

66.70 

-134.16 

83.86 

TABLE  8 - DR>  ATMOSPHERE 
Sat  Sep 


7863  ! 

! 7888 

Set 

No.  Prof 

Min.  alt. 

Max.  alt. 

No.  Prof 

Min.  alt. 

Max.  alt. 

2 

14 

5.78 

7.86 

14 

3.59 

5.44 

3 

20 

7.55 

9.75 

19 

4.85 

6.81 

4 

18 

9.59 

12.56 

18 

7.29 

9.91 

3 -9 
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Figt  * 3.-  Standard  deviation  and  rmi  error  versus  pressure. 
Ty  atmosphere  satellite  separation  7863.0  km. 
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Figure  4.-  Standard  d ...tion  and  rms  error  versus  pressure. 
Dray  atmo^nere  satellite  separation  7888.0  km. 


water  vapor  correction  was  used  in  the  second  half 
(“measured  data”  portion)  of  the  program.  This 
simplistic  correction  consisted  in  each  case  of  two 
humidity  profiles,  representing  either  a “high”  value 
of  water  vapor  or  a “low”  value  (see  fig.  5).  The 
appropriate  corrective  profile  was  chosen  based  on 
the  surface  value  of  the  rawinsonde  water  vapor 
profile.  All  the  data  sets  from  sets  3 and  4,  and 
one  of  the  profiles  from  set  2,  used  the  lower 
corrective  profile;  the  remaining  profiles  in  set  2 
used  the  higher  corrective  profile.  The  results  of 
this  run  are  summarized  in  tables  9 through  14. 
Tables  9 through  1 1 give  the  results  for  a satellite 
separation  of  7S63  km.  and  tables  12  through  14 
give  the  results  for  a satellite  separation  of  7888  km. 
The  corresponding  closest  approach  altitudes  are 
listed  in  table  15.  The  results  are  shown  in  figures 
6 and  7. 


Figure  6.-  Standard  deviation  and  rms  error  versus  pressure. 
Wet  atmosphere  with  simplistic  climatic  correction  satellite 
separation  7863.0  km. 


Profile*  adapted  iron  Figure  6,  Hutcherson,  D.C. , "Vittr  Vapor  in 
the  Atnoaphere,"  Huwudity  and  Mol* tore.  Vol.  2,  1969,  pp.  4*6-494. 

Th*  above  profile*  were  extrapolated  to  29  k»  by  u*e  of  the  equation 
e(x)  - :r)  exp [0.9476  • where  e it  the  water  vapor  pressure, 

in  *tl)U  r» t and  t la  th#  altitude,  in  klloawtera. 


Figure  6.-  Water  vapor  profiles. 
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Figure  7.-  Standard  deviation  and  rms  error  versus  pressure. 
Wet  atmosphere  with  simplistic  climatic  correction  satellite 
separation  7888.0  km. 
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TABLE  9 - STATISTICS  FOR  6 STATIONS 
SATELLITE  SEPARATION  7863.0000  km  - SET  2 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

50  000 

13.12 

16.80 

21.31 

-7.95 

40.43 

400.000 

8.31 

20  ?i 

21.85 

-23.56 

29.32 

300.000 

15.22 

26.48 

30.54 

-27.64 

40.54 

200.000 

34.51 

34.32 

48.67 

-2.97 

93.47 

140.000 

32.68 

47.46 

57.63 

-18.32 

80.53 

100.000 

19.16 

31.17 

36.59 

-26.20 

59.18 

70.000 

6.13 

43.39 

43.82 

-56.23 

46.92 

50.000 

11.06 

59.10 

60.13 

-52.70 

83.33 

30.000 

15.58 

26.30 

30.57 

-19.93 

46.52 

TABLE  10.-  STATISTICS  FOR  8 STATIONS 
SATELLITE  SEPARATION  7863.0000  km  SET  3 

Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

39.18 

33.84 

51.77 

-5.46 

99.36 

400.000 

30.87 

30.54 

43.42 

-1.78 

88.95 

300.000 

21.53 

29.34 

36.39 

-16.31 

68.32 

200.000 

38.97 

38.62 

54.86 

-3.64 

94.15 

140.000 

50.94 

48.82 

70.56 

-22.24 

121.11 

100.000 

44.02 

46.73 

64.20 

-35.13 

105.45 

70.000 

38.30 

34.71 

51.69 

-13.50 

88.64 

50.000 

32.68 

31.24 

45.21 

-21.36 

70.95 

30.000 

43.89 

40.11 

59.45 

-33.54 

85.94 

TABLE  11.-  STATISTICS  FOR  7 STATIONS 
SATELLITE  SEPARATION  7863.0000  km  - SET 4 

Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

25.19 

27.94 

37.62 

-20.77 

66.63 

400.000 

23.62 

25.98 

35.12 

-26.80 

52.06 

300.000 

9.41 

8.46 

12.66 

-3.93 

18.65 

200.000 

13.59 

14.00 

19.51 

-7.75 

35.83 

140.000 

17.46 

21.65 

27.81 

-13.57 

41  90 

100.000 

25.93 

25.36 

36.27 

-23.61 

56.55 

70.000 

29.48 

26.43 

39.55 

-20.16 

69.11 

50.000 

14.41 

24.78 

28.66 

-19.81 

43.91 

30.000 

-30.30 

47.90 

56.68 

-91.29 

39.23 
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TABLE  12.  - STATISTICS  FOR  6 STATIONS 
SATELLITE  SEPARATION  7888.0000  km  - SET  2 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Mav 

500.000 

-53.11 

37  21 

64.85 

-97.58 

8.24 

400.000 

-57.93 

41.29 

71.14 

-122.42 

-2.86 

300.000 

-51.02 

35.27 

68.21 

-126  50 

0.03 

200.000 

-31.72 

4/98 

57.52 

-68.77 

61.28 

140.000 

-33.55 

49.06 

59.44 

-39.27 

43.35 

100.000 

-47.07 

37.68 

60.29 

-77.58 

27.00 

70.000 

-60.11 

60.83 

85,52 

-142.15 

9.67 

50.000 

-55.17 

73.51 

91.9! 

-151.57 

12.85 

30  000 

-50.65 

46.45 

68  73 

-118.79 

5.25 

TABLE  13.  STATISTICS  FOR  8 STATIONS 
SATELLITE  SEPARATION  7888.0000  km  - SET 

3 

Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

51.29 

57.44 

77.00 

-12  32 

146.25 

400.000 

42.9' 

56-47 

70.96 

-31.52 

135.84 

300.000 

3.3.63 

58.84 

67.77 

-51.88 

115.21 

200.000 

51.08 

63.97 

81.86 

-32.43 

135.99 

140.000 

63.0' 

69.96 

94.18 

-30.39 

168.00 

100.000 

56.13 

72.17 

91.42 

-43.29 

150.06 

70.000 

50.'0 

63.12 

80.77 

-21.67 

126.30 

50.000 

50.12 

50.68 

71.27 

-2.69 

117.84 

30.000 

55.99 

50.55 

75.44 

-1.54 

125.75 

TABLE  14  - STATISTICS  FOR  7 STATIONS 
SATELLITE  SEPARATION  7888.0000  km  - SET  4 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Max 

500.000 

15.78 

48.91 

51.40 

-34.20 

99.85 

400.000 

14.21 

42.20 

44.53 

-37.26 

85.28 

300000 

46.03 

46.03 

-67.46 

67.63 

200.000 

4.18 

64.08 

64.22 

-89.11 

92.39 

140.000 

8.06 

60.09 

60.63 

-94.93 

75.12 

100.000 

16.53 

49.55 

52.23 

-68.07 

67.30 

44.89 

49.18 

-41.15 

72.56 

5.00 

59.20 

59.41 

-65.16 

75.32 

-39.71 

93.34 

101.44 

-145.74 

94.39 

TABLE  IS.-  WET  ATMOSPHERE  WITH  SIMPLISTIC 


CLIMATIC  CORRECTION 


Sat 

Sep 

7863 

7888 

3 

Set 

No.  Prof  Min.  alt 

Max.  alt. 

No.  Prof 

Min.  alt. 

Max.  alt. 

2 

6 C .25 

7.59 

6 

4.11 

4.95 

3 

8 7.l2 

9.96 

8 

4.99 

7.10 

4 

? 9.70 

11.47 

7 

7.40 

9.28 

t 


i 
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To  determine  the  effect  of  an  improvement  in 
the  water  vapor  correction  technique -that  is,  the  use 
of  more  sophisticated  techniques  than  the  simplistic 
method  described  above-the  same  data  were  run  with 
the  exact  rawinsonde  water  vapor  profile  used  in  both 
the  reference  atmosphere  and  in  the  “measured”  atmo- 
sphere. This  is  equivalent  to  an  ideal  water  vapor 
correction.  Results  of  this  run  (for  the  7863-km 
satellite  separation  only)  are  given  in  tables  16  through 
18,  and  in  figure  8.  The  corresponding  closest  approach 
altitudes  are  shown  in  table  19. 

Finally,  to  determine  the  effects  of  the  temperature 
errors  on  the  system,  the  analysis  of  six  low  altitude  . 
profiles  (without  water)  was  repeated,  this  time  using 
the  reference  temperature  values  (the  values  without  any 
error)  in  both  the  reference  atmosphere  and  in  the 
“measured”  atmosphere.  This  is  equivalent  to  having 
an  ideal  temperature  sensor.  The  result  of  this  run  is 
shown  in  figure  9,  for  the  7863-km  satellite  separation 
only. 


Figure  8.-  Standard  deviation  and  mu  error  venus  pressure. 
Wet  atmosphere  with  ideal  water  vapor  correction  satellite 
separation  7863.0  km. 


Latitude  ranee  25°  - 30°  N 
6 Profiles 

Standard  Deviation  RMS 


Figure  9.-  Standard  deviation  and  rms  error  versus 
pressure.  Ideal  temperature  profile,  dry 
atmosphere. 


ANALYSIS 


In  figures  3, 4, 6,  and  7,  the  solid  line  represents  the  standard  deviation  of  the  error  (in  meters)  while  the  stipled 
line  indicates  the  rms  error;  both  are  shown  as  a function  of  pressure.  The  difference  between  the  standard  deviation 
and  the  rms  values  at  each  pressure  level  is  an  indication  of  the  size  of  the  mean  error  at  this  level.  The  rms  value  is 
equal  to  the  square  root  of  the  sum  of  the  squaies  of  die  standard  deviation  and  the  mean  (rms*  * SD*  + mean* ). 

A cursory  examination  of  these  figures  shows  that  the  curve  of  altitude  error  versus  pressure  follows  a typical 
pattern,  regardless  of  whether  the  atmosphere  is  wet  or  dry,  or,  if  wet,  how  the  water  vapor  error  is  corrected. 
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TABLE  16.  STATISTICS  FOR  6 ST/TIONS 


SATELLITE  SEPARATION  7863.0000  Um  - SET  2 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Ma\ 

500.000 

10.37 

8.66 

13.51 

-0.26 

21.08 

400.000 

5.56 

15.49 

16.45 

-19.55 

28.87 

300.000 

12.47 

22.52 

25.74 

-23.63 

45.27 

200.000 

31.76 

27.86 

42.2S 

1.24 

71.97 

1-<0.000 

29.93 

45.59 

54.54 

-23.13 

80.99 

100.000 

16.42 

27.90 

32.38 

-18.49 

46.27 

70.000 

3.38 

38.68 

28.83 

-54  34 

56.15 

50.000 

8.31 

55.82 

56.43 

-49.15 

92.56 

30.000 

12.83 

24.88 

28.00 

-15.91 

55.76 

TABLE  17.  STATISTICS  FOR  8 STATIONS 


SATELLITE  SEPARATION  7863.0000  kin  SET  2 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Mn\ 

500.000 

13.91 

20.:; 

24,60 

-In. 24 

38.74 

400.000 

5.59 

17.51 

18.38 

-19.58 

27.90 

300.000 

-.3.74 

’8.36 

18.74 

-39.93 

24.18 

200.000 

13.70 

28.19 

31.34 

-20.49 

66.59 

140.000 

25.67 

39.40 

47.02 

-40.89 

92.33 

100.000 

18.75 

37.49 

41.92 

-53.79 

77.89 

70.000 

13.02 

26.43 

29.47 

-32.17 

61.07 

50.000 

12.74 

12.80 

18.06 

2.70 

42.34 

30.000 

18.61 

32.47 

37.43 

-46.31 

67.28 

TABLE  18. - STATISTICS  FOR  7 STATIONS 
SATELLITE  SEPARATION  7863.0000  km  - SET  4 


Level  (mb) 

Mean 

SD 

RMS 

Min 

Ma\ 

500.000 

~ 2o3T 

K05 

5T53 

-i5. 66 

4ff^i 

400.000 

18.81 

25.17 

31.42 

-31.70 

45.03 

300.000 

4.60 

11.40 

12  29 

-9.59 

19.66 

200.000 

8.78 

11.17 

14.20 

-8.24 

16.14 

140.000 

12.65 

20.65 

24.22 

-13.66 

34.70 

100.000 

21.12 

27.21 

34.45 

-28.50 

58.59 

70.000 

24.67 

29.83 

38.71 

-25.05 

71.14 

50.000 

960 

25.67 

27.41 

-20.39 

45.94 

30.000 

-35.12 

44.69 

56.83 

-100.97 

18.59 

TABLE  19.-  WET  ATMOSPHERE  WITH 

IDEAL  CORRECTION 


Sat 

Sep 

Set 

No.  Prof 

Min.  alt.  Max.  alt. 

2 

6 

6.11 

7.65 

3 

8 

7.69 

10.00 

4 

7 

9.98 

11.85 

3-74 


Typically  there  are  two  minima  and  two  maxima.  One  of  the  minima  generally  occurs  in  the  neighborhood  of  300 
to  400  mb,  and  the  second  in  the  neighborhood  of  50  to  70  mb.  The  maxima  are  near  the  30-mb  level  and  between 
100  and  200  mb  (which,  for  most  profiles,  is  the  pressure  level  of  the  tropopause). 


The  pattern  may  be  explained  in  the  following  way:  the  minimum  at  the  lower  altitude,  near  the  300-mb 
level,  is  associated  with  the  closest  approach  altitude  of  the  ray  path  (indicated  by  the  vertical  line  in  each  figure). 
The  system  is  most  sensitive  to  atmospheric  effects 
in  approximately  the  lower  3 km  of  the  ray  path,  so 
the  absolute  value  of  error  in  this  neighborhood  tends 
to  be  minimized. 

The  overall  pattern  is  due  to  the  errors  in  the 
temperature  profile,  which  generate  errors  in  altitude 
when  the  temperature-altitude  profile  is  constructed 
using  the  hypsometric  formula.  This  effect  is  illustrated 
by  the  absence  of  any  extrema  at  all  in  the  results  of 
the  ideal  temperature  profile  case  (fig.  9).  Here  each 
level  has  the  same  error  as  that  of  the  300-  or  400-mb 
level. 

The  magnitude  of  the  error  is,  of  course,  of 
primary  interest.  We  see  in  figures  3 and  4 that  the 
rms  and  standard  deviation  of  the  error  below  300  mb 
are  generally  around  20  to  30  m.  Figure  10  shows  the 
300-mb  error  for  each  of  the  profiles  used  in  the 
generation  of  figure  3,  plotted  as  a function  of  rms 
temperature  error  in  the  100-  to  400-mb  pressure 
range.  While  there  is  no  clear  regression  line,  it  should 
be  noted  that  the  upper  left-hand  portion  of  the  chart 
is  free  of  any  data  points.  This  indicates  that  a low  rms 
value  of  temperature  error  in  the  100-  to  400-mb 
pressure  range  invariably  results  in  a small  altitude  error 
at  the  300-mb  level.  It  is  therefore  reasonable  to  expect 
that  improvements  in  the  performance  of  the  SIRS 

instrument  will  lead  to  concomitant  improvements  in  the  performance  of  the  occultation  system.  (It  should  also 
be  noted  that  a high  rms  value  of  tempeiature  error  does  not  necessarily  lead  to  a large  error  in  the  altitude  of  the 
300-mb  level,  as  witnessed  by  the  large  number  of  points  in  the  lower  right-hand  portion  of  the  chart.) 


Figure  10.-  Altitude  error  it  300-mb  versus  temperature  rms 
error  (pressure  range  10C  400  mb). 


Comparison  of  figures  3 and  4 shows  that  it  is  possible  to  choose  a satellite  separation  for  each  of  the  latitude 
ranges  such  that  the  rms  and  standard  deviation  at  the  300-mb  level  is  less  than  26  m.  We  do  not  suggest  that  there 
must  necessarily  be  more  than  one  satellite  pair  in  the  system,  but  merely  note  that  minimum  ray  altitude  is  an 
important  parameter  *hat  should  be  chosen  in  such  a way  as  to  optimize  the  system  performance.  In  each  of  the 
figures,  the  lower  minimum  occurs  in  the  range  of  the  closest  approach  altitude. 


Turning  to  figure  6,  we  see  that  the  simplistic  water  vapor  correction  gives  remarkably  good  results.  RMS  and 
standard  deviation  at  the  300-mb  level  and  below  are  once  again  in  the  neighborhood  of  20  to  30  m.  That  the  errors 
are  due  the  failure  of  the  corrective  profile  to  adequately  acc*  mt  for  the  water  vapor  is  illustrated  by  figure  8, 
which  gives  the  results  for  the  “ideal”  water  vapor  correction.  1 results  shown  in  figure  8 should  be  compared 
to  the  corresponding  curves  of  figure  3.  It  is  apparent  that  errors  introduced  by  water  vapor  can  be  eliminated  by 
properly  correcting  for  the  water  vapor.  If  exact  water  vapor  corrections  are  not  available,  figure  6 shows  that 
even  the  most  simplistic  of  corrections,  in  which  a standard  water  vapor  profile  is  used  to  correct  for  water  v'lpor 
effects,  can  have  dramatic  results.  The  effect  of  not  correcting  for  water  vapor  at  all  is  to  obtain  errors  on  the  order 
of  hundreds  of  meters. 
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The  effect  of  varying  minimum  ray  height  on  the  system  performance  in  the  presence  of  water  vapor  is 
illustrated  in  figure  7.  We  see  that  the  system  operating  at  the  lower  ray  height  (greater  satellite  separation)  yields 
generally  poorer  results  than  the  system  operating  at  the  greater  ray  height  (smaller  satellite  separation).  Since  water 
' apor  concentration  falls  off  rapidly  with  altitude  (scale  height  on  the  cider  of  2 km),  this  is  not  a particularly 
surprising  result.  Proper  water  vapor  correction  is  more  important  as  the  ray  passes  into  regions  of  high  water  vapor 
content.  Although  it  was  not  possible  to  run  the  experiment  of  figure  8 for  the  lower  ray  height,  presumably  an 
“ideal”  water  vapor  correction  would  have  the  same  dramatic  result  in  this  case  as  it  had  at  the  higher  ray  altitude. 


CONCLUSIONS 


Based  on  the  analysis  or  the  simulated  SIRS  data,  we  can  say  with  confidence  that  the  microwave  occultation 
system  is  capable  of  establishing  a 300-mb  pressure  reference  level  to  within  about  24  m.  The  effects  of  water  vapor 
can  be  corrected  by  suitable  climatological  profiles.  Our  work  has  shown  that  even  the  most  simplistic  climatological 
approach  results  in  a dramatic  improvement  in  accuracy;  more  sophisticated  methods  are  bound  to  be  even  more 
effective. 

Improvements  in  the  accuracy  of  the  SIRS  instrument  will  yield  improvements  in  the  performance  of  the 
occultation  system.  Most  of  the  errors  in  the  output  of  the  system  (neglecting  those  due  to  water  vapor)  are  due 
to  error  in  the  input  temperature  profile.  As  these  input  errors  are  reduced,  system  performance  will  be  significantly 
improved.  Bui  even  with  the  present  SIRS  instrumentation,  the  microwave  occultation  system  gives  results  that 
are  acceptable  to,  snd  badly  needed  by,  the  meteorological  community. 
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DISCUSSION 


Gille:  Does  this  give  the  absolute  altitude  of  the  pressure  reference  above  the  surface? 

Lusignan:  Yes.  It  is  referred  to  as  the  geopotential  of  the  earth. 

Gille:  How  sensitive  is  that  to  orbit  variations? 

Lusignan:  The  system  relies  on  knowing  differential  movement  between  the  satellite,  to  the  order  of  O.S  m. 

It  is  similar  to  the  transponder  used  in  Venus  and  Mars  experiments. 

Reinisch:  You  are  making  phase  measurements;  at  6 -cm  wavelength,  how  great  is  the  uncertainty  of  the 

separation  and  how  do  you  take  care  of  Modulo  2n  phase  change? 

Lusignan:  There  are  two  systems,  parallel  to  those  of  the  Mariner  flights.  There  is  a puase  system,  a coherent 

transponder,  which  gives  an  accuracy  of  3-cm.  This  is  a differential  measurement.  The  second  system  has 
an  FM  modulation  superimposed  that  gives  an  absolute  reference  to  the  order  of  0.S  m. 

Wright:  Is  there  an  ionospheric  version  of  this  experiment  under  consideration? 

Lusignan:  Yes.  Dr.  Grossi  has  proposed  dual  satellite  ionosphere  studies. 

Rodgers:  What  effect  will  horizontal  inhomogeneities  in  water  vapor  and  temperature  have  on  your 

measurements? 

Lusignan:  We’ve  run  tests  with  horizontal  profiles  in  them.  Profiles  across  the  path  have  negligible  effect. 

Along  the  path  we  are  sensitive  to  an  integrated  average  over  about  300  to  500  km.  We  measure  the  weighted 
average  of  the  pressure  in  that  distance. 

Reinisch:  Do  you  expect  a lot  of  phase  scintillation  that  would  distort  your  measurement? 

Lusignan:  In  the  lower  part  of  the  atmosphere  where  there  is  water  vapor,  we  do  get  some  phase  scintillation, 

but  it  does  not  affect  our  group  path  measurement.  We  do  expect  some  clean  multipath  in  the  lower  part 
when  in  the  range  500-700  mb.  Normally  we  operate  at  an  altitude  above  where  this  occurs  (above  300  mb). 


3-77 


OCCULTATION  MEASUREMENTS 


BIBLIOGRAPHY 

Backus,  G.,  and  Gilbert.  J.  F.,  Numerical  Applications  of  a Formalism  for  Geophysical  Inverse 
Problems,  Geophys.  J.  Roy.  Astron.  Soc.,  13.  247-276,  1967. 

Backus,  G.,  and  Gilbert,  J.  F.,  The  Resolving  Power  of  Gross  Earth  Data,  Geophys.  J.  Roy.  Astron 
Soc.,  16.  169-205.  19o8. 

Backus,  G..  and  Gilbert,  J.  F.,  Uniqueness  in  the  Inversion  of  Inaccurate  Gross  Earth  Data.  Phil. 
Trans.  Roy.  Soc.  London,  266,  123-192,  March  5,  1970. 

Baum.  W.  A.,  and  Code,  A.  D.,  A Photometric  Observation  of  the  Occultation  of  Sigma  Arietis  , 
Jupiter.  Astron.  J..  58,  108-112,  1953. 

Beers.  Y..  Introduction  to  the  Theory  of  Error.  Addison-Wesley,  Reading,  Mass.,  1957. 

Bennett.  J.  A..  The  Calculation  of  Doppler  Shifts  Due  to  a Changing  Ionosphere,  J.  Atmos.  Terr. 
Phys..  29.  887-891,  1967. 

Bennett.  J.  A..  The  Ray  Theory  of  Doppler  Fiequency  Shifts,  Aust.  J.  Phys.,  21,  259-272,  1963. 

Bennett.  J.  A..  On  the  Application  of  Variation  Techniques  to  the  Ray  Theory  of  Radio  Propaga- 
tion. Radio  Sci-.  4.  667-678,  Aug.  1969. 

Cain.  D.  L..  Kliore.  A.  J..  and  Levy,  G.  S..  The  Mariner  4 Occultation  Experiment:  Summary  of 
Data  Reduction  Methods.  AIAA  Paper  No.  66-148,  AIAA  3rd  Aerospace  Sciences  Meeting, 
New  York.  New  York,  January  24-26,  1966. 

Chamberlain.  J.W..  and  McElroy,  M.  B.,  The  Martian  Atmosr/nere:  An  Interpretation  of  the 
Mariner  Occultation  Experiment,  Science,  152,  21,  1966. 

Coogan.  J.  M..  A Method  for  Studying  Planetary  Atmospheres  Employing  the  Dual  Flyby  Mode, 
AIAA  5th  Aerospace  Conference,  New  York,  New  Yorx,  Jan.  23-26,  19b7. 

de  Vaucouleurs,  G.,  A Survey  of  Physical  Problems  of  the  Nearer  Planets  and  a Review  of 
Observational  Techniques  Applicable  to  Balloon-Borne  Telescope  Systems,  Ch.  2 in  Strong 
J D..  de  Vaucouleurs  G.,  and  Zwicky  P.,  “Planetary  Astronomy  from  Satellite-Substitute 
Vehicles”,  Missile  Development  Center,  Holloman  Air  Force  Base,  New  Mexico,  Report 
TR-60-6.  March  1960. 

Elsmore,  B.,  Radio  Observations  of  the  Lunar  Ionosphere,  Philo.  Mag.,  2,  1040-1046,  1957 

Eshleman.  V.  R..  Solar  System  Radio  Astronomy,  Proceedings  of  the  Advanced  Study  Institute  on 
Solar  System  Radio  Astronomy,  Sounion,  Greece,  1964,  .!.  Aarons,  ed.,  Plenum  Press,  New 
York.  Ch.  14.  267-293,  1965. 

L'lltleman.  V.  R..  Radar  Astronomy,  Science.  158,  585-597,  Nov.  3,  1967. 


Eshleman,  V.  R„  Fjeldbo,  G.,  Anderson,  J.  D.,  Kliore,  A.,  Dyce,  R.  B.,  Venus:  Lower  Atmosphere 
Not  Measured.  Science,  162,  661-662,  8 Nov.  1968. 


3-78 


Esposito.  R.,  and  Schumer.  M.  A..  Probing  Techniques  for  Transmission  Absorption  Spectroscopy 
of  Planetary  Atmospheres  Using  Satellites.  Remote  Sensing  of  Environment.  1 . S . — sj j 
1969/1970 

Fabry.  C\.  Le  Role  des  Atmospheres  dan  les  Occultations  par  les  Planetes.  J.  Obsersateurs.  12. 
1-10.  1929. 


Fischbach.  F.  F..  Remarks  on  the  Inference  of  Neutral  Atmospheric  Pa. an  eters  from  Microwave 
Occultation  Data.  JPL  Tech.  Report  32-1475.  99-103.  July  I.  1970. 


Fischbach.  F.  F..  Graves.  M.  t..  Hays,  P.  B..  Klein.  G.  S..  Mizgala.  C M..  and  Peterson.  J.  \V.. 
Aiinospheric  Sounding  by  Satellite  Measurements  of  Stellar  Refraction.  Unis,  of  Michigan 
ORA  Tech.  Rep.  04963-2-T.  Dec.  . '>62. 

Fischbach.  F.  F..  Graves.  M.  E..  Hays.  P.  B..  and  Roble.  R.  G.,  Satellite  Measurement  of  Atmo- 
spheric Structure  by  Stellar  Refraction,  Univ.  Michigan  ORA  Tech.  Re;  06647-1-T  April  1965. 

Fischbach,  F.  F..  Graves.  M.  E.,  and  Jones.  L.  M..  Satellite  Meteorology  by  Microwave  Refraction: 
Some  Problems.  COSPAR  Xith  Plenary  Meeting.  Tokyo.  Space  Research  IX,  North  Holland. 
1969. 


Fjeldbo.  G.,  Bistatic-Radar  Methods  for  Studying  Planetary  Ionospheres  and  Surfaces.  Stanford 
Univ.,  Rep.  SU-SEL-64-925,  Stanford  Electronics  Lab.  Stanford  University.  I964.a. 

Fjeldbo.  G.,  Brief  Summary  of  Methods  Used  to  Study  Model  Atmospheres  for  Mars,  Stunfoi  J 
Univ.  memo,  dated  1964  b. 

Fjeldbo,  G.,  Radio  Occultation  Measurements  of  Planetary  Atmospheres  and  Planetary  Surface 
Topography,  AIAA  5th  Aerospace  Sciences  Meeting,  Jan.  23-26,  1967. 


Fjeldbo,  G.,  and  Eshleman,  V.  R.,  Tne  Bistatic  Radar  Occultation  Method  for  the  Study  of 
Planetary  Atmospheres,  J.  Geophys.  Res..  70,  3217-3226,  1965. 


Fjeldbo,  G..  and  Eshleman,  V.  R..  The  Atmosphere  of  Mars  Analysed  by  Integral  Inversion  of  the 
Mariner  IV  Oc  ultation  Data,  Stanford  Univ.  Rep.  No.  SU-SEL-67-109,  Nov.  1967. 

Fjeldbo,  G.,  and  Eshleman,  V.  R.,  The  Atmosphere  of  Mars  Analyzed  by  Integral  Inversion  of  the 
Mariner  IV  Occultation  Data.  Planet.  Space  Sci..  16,  1035-1059,  1968. 

Fjeldbo,  G„  and  Eshleman,  V.  R„  The  Atmosphere  of  Mars:  Radio  Occultation  Measurements  and 
Interpretations,  The  Atmospheres  of  Venus  and  Mars,  J.  C.  Brandt  and  M.  3.  McElroy,  eds., 
Gordon  and  Breach,  N.Y.,  1968. 

Fjeldbo,  G.,  and  Eshleman,  V.  R.,  The  Atmosphere  of  Venus  as  Studied  with  the  Mariner  5 Dual 
Radio  Frequency  Occultation  Experiment,  Stanford  Univ.  Rep.  SU-SEL-69-003,  Jan.  1969. 

Fjeldbo.  G.,  and  Eshleman,  V.  R.,  The  Atmosphere  of  Venus  as  Studied  with  the  Mariner  5 Dual 
Radio  Frequency  Occultation  Experiment,  Radio  Sci..  4,  879-897,  1969. 


3-79 


Fjeldbo.  G..  Eshleman.  V.  R.,  Garriott,  O.  K..  and  Smith,  F.  I...  The  Two-Frequency  Bistatic 
Radar-Occultution  Method  for  the  Study  of  Planetary  Atmospheies.  J.  Geophys.  Res..  70, 
3701-37 10,  1 965. 

Fjeldbo.  G..  Eshleman,  V.  R..  Kliore,  A.  J..  Cam,  D.  L..  Levy,  G.  S..  and  Drake,  F.  D..  Preliminary 
Results  of  the  Mariner  IV  Radio  Oecullaticn  Measurements  of  the  Upper  Atmosphere  of  Mars. 
Proc.  of  Caltech-JPL  Lunar  and  Planetary  Conf..  267-272,  Sept.  1 3-18.  1965. 

Fjeldbo.  G..  Fjeldbo.  W.  C..  and  Eshleman,  V.  R..  Models  for  Atmosphere  of  Mars  Based  or.  the 
Mariner  4 Occultation  Experiment.  J.  Geophys.  Res  , 71, 2307-2316.  May  1.  1966  a. 

Fjelabo.  G..  Fjeldbo.  W C..  and  Eshleman,  V.  R..  Atmosphere  of  Mars:  Mariner  IV  Models 
Compared,  Science,  1 a,  1518  -1523,  1966  b. 


Fjeldbo.  G..  Kliore.  A.,  and  Eshleman.  V.  R..  The  Neutral  Atmosphere  of  Venus  as  Studied  with 
the  Mariner  5 Radio  Occultation  Experiments.  Astronom.  J.,  76,  123-140.  March  1971. 


Fjeldbo.  G..  Kliore.  A.  J..  and  Seidel.  B.,  The  Mariner  1969  Occultation  Measurements  of  the 
Upper  Atmosphere  of  Mars.  Radio  S:L  5,  381-386.  1970. 

Gerver.  M.  L...  and  Markushevich,  V M..  Determination  of  the  Seismic  Wave  Velocity  from  the 
Travel-Time  Curve,  Geophys.  !.  Roy.  Astron.  Soc..  11,  165-173.  1966. 

Gerver.  M.  L.,  and  Markushevich,  V.  M.,  On  the  Characteristic  Properties  of  Travel-Time  Curves, 
Geophys.  J.  Roy.  Astron  Soc.,  1 ’.  241-246,  1967. 

Goldsmith.  D.  W.,  Differential  Refraction  in  Planetary  Atmospheres  with  Linear  Scale  Height 
Gradients.  Icarus.  2,  341-349,  1963. 

Graves.  M.  E..  and  Fischbach,  F.  F„  Analysis  of  Microwave  Occultation  Techniques  for  Atmo- 
spheric Soundings.  Tech.  Rep.  05863- 16-T,  High  Altitude  Engineering  Lab.,  U.  Michigan. 
1969. 

Grossi.  M.  D.,  Lee,  J.  F„  Shear,  I.,  Bravoco,  R.  R.,  Langevin,  P.  E.,  Goff,  R.  W.,  Boe  itz,  M.,  and 
Stifller,  J.  J.  Electromagnetic  Probing  of  the  Mars  and  Venus  Atmospheres  and  Ionospheres 
from  an  Orbiting  Pair.  Raytheon  Rep.  R69-4374-I,  Sudbury,  Mass..  Final  Report  on  Contract 
NASw-i  772.  Sept.  19.  1969. 

Hagfors,  T..  Reflection  and  Diffraction  in  a Bistatic  Radar  Occultation  Experiment,  MIT 
Lincoln  Lab  Rep.,  August  2,  1967. 

Hamel.  G..  Intvgralgleichungen:  Eir.fdhrung  in  Lehre  und  Gebrauch,  Springer  Verlag,  Be  lin,  1937. 


Harrington,  J.  V..  Study  of  a Small  Solar  Probe  (Sunblazer)  Part  I,  Radio  Propagation  Experi- 
ment, mIT  Rep.  PR-5255-5,  July  1,  1965. 

Harrington.  J.  V.,  Goff,  R.  J.,  Grossi,  M.  D.,  Langworthy,  B.  M.,  Electromagnetic  Measurements  of 
Planetary  Atmospheres  and  ionospheres  by  an  Orbital  Pair,  Raytheon  IRP  Rep.  FR-66-7), 
Marcn  7.  1966. 


3-80 


Harrington,  J.  V.,  and  Grossi.  M.  D.,  Global  Probing  of  the  Earth’s  Atmosphere  and  Ionosphere 
by  a Satellite-to-Satellite  Radio  Occultation  Method,  3rd  National  Conf.  on  Aerospace 
Meteorology,  New  Orleans,  La.,  May  6-9,  1968. 


Harrington,  J.  V.,  Grossi,  M.  D.,  Goff,  R.  W„  and  Langworthy.  B.  M..  Radio  Occultation  Measure- 
ments of  Planetary  Atmospheres  and  Ionospheres  from  an  Oi biting  Pair.  AIAA  Paper 
No.  69-53.  AIAA  7th  Aerospace  Sciences  Meeting.  New  York,  New  York.  January  20-22,  1969. 


Harrington.  J.  V.,  Grossi.  M.  D..  and  Langworthy,  B.  M.,  Mars  Manner  IV  Radio  Occultation 
Experiment;  Comments  on  the  Uniqueness  of  the  Results.  L Geophys,  Res..  73,  3039-3041, 
1968. 


Hays,  P.  B..  and  Fischbach,  F.  F..  Analytic  Solution  for  Atmospheric  Density  from  Satellite 
Measurements  of  Stellar  Refraction,  U.  Michigan  ORA  Tech.  Rep  04963-3-T,  Jan.  1963. 

Hays.  P.  B.,  and  Roble,  R.  G.,  Atmospheric  Properties  from  the  Inversion  of  Planetary  Occultation 
Data.  Planet.  Space  Sci..  16.  1 197-1 198,  1968. 

Herglotz,  G„  Uber  das  Benndorfsclie  Problem  der  Fortpflanzungsgeschwindigkeit  der  Erdeben- 
strahlen,  Phys.  7.,  8,  145.  1907. 

Hunten,  D.  M.,  and  McElroy,  M.  B.,  The  Upper  Atmosphere  of  Venus:  The  Regulus  Occultation 
Reconsidered,  J.  Geophys.  Res..  73, 4446-4448,  1968. 

Jones.  L.  M.,  Fischbach,  F.  F.,  and  Peterson  J.  W.,  Atmospheric  Measurements  from  Satellite 
Observations  of  Steilar  Refraction,  U.  Michigan  ORA  Tech.  Rep.  04963-1-T,  Jan.  1962. 

Jones,  L M.,  Fischbach,  F.  F.,  and  Peterson,  J.  W.,  Satellite  Measurements  of  Atmospheric  Struc- 
ture by  Refraction,  Planet.  Space  Sci.,  9,  351-252,  June  !%2. 


Kliore,  A.  J.,  and  Cain,  D.  L.,  Mariner  5 and  the  Radius  of  Venus,  J.  Atmos.  Sci,,  25.  549-554. 
1968 


Klioie,  A.  J.,  Cain,  D.  L„  Drake,  F.  D.,  Eshieman,  V.  R.,  Fjeldbo,  G.,  and  Levy,  G.  S.,  A Summary 
of  Preliminary  Results  of  Mariner  IV  Radio  Occultation  Experiment,  Proc.  Ionospheric 
Research  Committee  „f  the  Avionicc  Panel,  75-81,  AGARD/NATO,  Rome,  Italy,  Sept.  21-25, 
1965. 


Kliore,  A.  J.,  Cain,  D.  L.,  and  Hamilton,  T.  W.,  Determination  of  Some  Physical  Properties  of  the 
Atmosphere  of  Mars  from  Changes  in  the  Doppler  Signal  of  the  Spacecraft  on  an  Earth- 
Occultation  Trajectory.  JPL  Tech.  Rep.  32-674,  Oct.  15,  1964. 


Kliore,  A.  J.,  Cain,  D.  L.,  and  Levy,  G.  S.,  Radio  Occultation  Measurements  of  the  Martian  Atmo- 
sphere Over  Two  Regions  by  the  Mariner  IV  Space  Probe,  Moon  and  Planets,  226-239,  North- 
Holland  Publishing  Co.,  1966. 


3-81 


Kliore.  A.  J.,  Cain.  D.  L.,  and  Levy,  G.  S.,  Radio  Occupation  Measurement  of  the  Martian  Atmo- 
sphere Over  Two  Regions  by  the  Mariner  4 Space  Probe,  Space  Research,  Vll,  Moon  and 
Planets.  North  Holland  Publishing  Company,  Amsterdam,  1968. 


Kliore.  A.  J..  Cain,  D.  L..  Levy,  G.  S.,  Eshleman,  V.  R.,  Drake,  F.  D.,  and  Fjeldbo.  G.,  The 
Mariner  4 Occupation  Experiment,  Astron.  Aeron.,  7 72-80.  1965. 


Kliore,  A.  J..  Cain.  D.  L.,  Levy,  G.  S.,  Eshleman,  V.  R.,  Fjeldbo,  G.,  and  Drake.  F.  D.,  Occultation 
Experiment:  Results  of  the  First  Direct  Measurement  of  Mars  Atmosphere  and  Ionosphere, 
Science.  149.  1 243-P48.  Sept.  10.  1965. 


Kliore.  A.  J..  Cain.  D.  L.,  Levy,  G.  S.,  Eshleman,  V.  R„  Fjeldbo,  G..  and  Drake,  F.  D..  Preliminary 
Results  of  the  Mariner  IV  Occultation  Measurement  of  the  Atmosphere  of  Mars.  Proc.  Caltech- 
JPL  Lunar  and  Planetary  Conf.,  P-  .adena,  Cal.,  257-266,  Sept.  13-18.  1965. 


Kliore.  A.  J.,  Cain,  D.  . Levy,  G.  S.,  Fjeldbo,  G.,  and  Rasool,  S.  L,  Structure  of  the  Atmosphere 
of  Venus  Derived  from  Mariner  5 S-Band  Occultation  Measurements,  Space  Research,  IX,  Moon 
and  Planets.  North  Holland  P’  blishing  Company,  Amsterdam,  1969. 


Kliore.  A.  J.,  Fjeldbo,  G.,  and  Seidel.  B.,  First  Results  of  the  Mariner  IV  Radio  Occultation 
Measurement  of  the  Lower  Atmosphere  of  Mars,  Radio  Sci..  5,  373-379,  Feb.  i 3 / O.a. 


Kliore.  A.  J..  Fjeldbo,  G.,  and  Seidel,  E.  L.,  Summary  of  Mariner  6 and  7 Radio  Occultation 
Results  on  the  Atmosphere  of  Mars.  Paper  No.  M-25,  13th  COSPAR  Meeting,  Leningrad, 
U.S.S.R.,  1970.b. 


Kliore.  A.  J.,  Fjeldbo,  G.,  Seidel,  B.  L.,  and  Rasool,  S.  1.,  Mariner  6 and?:  Radio  Occultation 
Measurements  of  the  Atmosphere  of  Mars,  Science,  166,  1393-1397,  1969. 


Kliore,  A.  J.,  Levy,  G.  S.,  Cain,  D.  L.,  Fjeldbo,  G.,  and  Rasool,  S 1.,  Atmosphere  and  Ionosphere 
of  Venus  From  the  Mariner  V S-Band  Radio  OcouitaGc'  Measurement,  Science,  158, 
1683-1688,  Dec.  1967. 


Kliore.  A.  J..  Tito,  D.  A.,  Cain,  D.  L.,  and  Levy,G.  S.,  A Radio  Occultation  Experiment  to  Probe 
the  Atmosphere  of  Venus,  A1AA  5th  Aerospace  Sciences  Meeting,  Jan.  23-26,  1967.  J.  Space- 
craft Rockets.  4,  1339-1346,  1967. 


Kliore.  A.  J.,  and  Tito,  D.  A.,  Radio  Occultation  Investigations  of  the  Atmosphere  of  Mars,  J. 
Spacecraft  4.  578-582,  May  1967. 


3-82 


Laiiv-.m.  M..  Russo,  L' , and  Tagliaferri,  G.  L..  Atmospheric  Density  in  the  120-190  km  Region 
Derived  From  the  X-Rcy  Extinction  Measured  by  the  US  NRL  Satellite  1 964-01 -D.  Nature.  206, 
173-174.1965. 

Landini.  M„  Russo,  D.,  and  Tagliaferri,  G.  L„  Atmospheric  Density  Measured  by  the  Attenuation 
of  the  Solar  X-Rays  Monitored  on  the  NRL  i965-16-D  Satellite,  Icarus,  6.  236-241,  1967. 

Langworthy,  B.  M.,  HRT-11  Model  Matching  Program.  Raytheon  Tech.  Memo  MS  J/AERO-3674. 
May  4.  1967. 

Link.  F.,  Eclipse  Phenomena.  Advances  in  Astronomy  and  Astrophysics  2,  Zdenek  Kopal.  ed.. 
Academic  Press.  87-198.  1963. 

Link,  M.  F..  Theorie  Photometrique  des  Eclipses  de  Lune,  Bull.  Astron.,  6 77-108,  1932. 

Link,  M F„  Occultations  et  Eclipses  par  les  Planetes,  Bull.  Astron.  10,  227  -249.  1934. 

Levy,  G.  S.,  Ostoshi,  T.  Y.,  and  Seidel,  B.  L.,  Ground  Instrumentation  for  Ma’.iner  IV  Occultation 
Experiment,  IEEE  Trans.  Instrumentation  and  Measurements.  1M-16,  100-1 14,  1967. 

Lombardini,  P.  P„  Doviak,  R.  J.,  and  Coldhirsh,  J.,  Centimetric  and  Millimetric  Wave  Spectroscopy 
of  Planetary  Atmospheres  from  Orbital  Pairs,  Raytheon  1RP  Rep.  FR-66-242,  Rev.  A,  March  1, 
1967. 

Lusignan,  B.,  A Preliminary  Study  of  Atmospheric  Density  Measurements  by  Means  of  Satel- 
lites, Final  Report  on  Contract  NAS  9-"  020,  Stanford  Electronics  Lab..  March  1968. 


Lusignan  B.,  Density  Measurement  with  Radio  Wave  Occultation  Techniques,  Space  Research  IX, 
603-609,  North-Holland  Publishing  Co.  Amsterdam,  1S69. 


Lusigr.an,  B.,  Inverting  Radio  Occultation  Data  Using  Empirical  Orthogonal  Function,  JPL  Tech. 
Rep.  32-1475,  1 15-1 17,  July  1,  1970. 


Lusigna  B.,  Modrell,  G.,  Morrison,  A.,  Pomalaza,  J.,  and  Ungar,  S.  G„  Sensing  the  Earth’s  Atmo- 
sphere with  Occultation  Satellites,  Proc.  IEEE,  57, 458-467,  1969. 


Mariner  Stanford  Group,  Venus:  Ionosphere  and  Atmosphere  as  Measured  by  Dual-Frequency 
Radio  Occultation  of  Mariner  V,  Science,  153,  1678-1683,  December  29,  1967. 

Martin,  F.  L.,  and  Wright,  F.  E.,  Radar  Ray  Refraction  Associated  with  Horizontal  Variations  in 
the  Refractivity,  J.  Geophys.  Res.,  68,  1861-1869,  1963. 

Meeus,  J.,  Occultations  of  Bright  Stars  by  Planets,  J.  Brit.  Astron.  Assoc.,  "O,  174-183,  I960. 

Menzel,  D.  H.,  and  de  Vancouleurs,  G.,  Results  From  the  Occultation  of  Regulus  by  Venus, 
Astron.  J.,  65,  351,  Aug.  1960. 


3-83 


Menzel,  D.  H.,  Optical  Refraction  in  a Planetary  Atmosphere  with  Application  to  the  Apparent 
Diameter  of  a Planet  and  to  Occultation,  GCA  Tech.  Rep.  61-33-A,  Contract  AF33(616)-7413. 
Geophysics  Corp.  America,  July  1961. 


Menzel.  D.  H..  and  de  Vancouleurs.  G.,  Final  Report  on  the  Occultation  of  Regulus  by  Venus, 
ARDC  Contract  AF19( 604 V746 ! , July  7,  1959. 


Mikhailov,  A.  A..  Teona  Zatmeniy  (Theory  of  Eclipses;,  l\  oscow  1954. 

Miller.  J D.  and  Goldman,  H.  J..  Feasibility  of  Multi-Satellite  Occultation  (Refraction)  Measure- 
niv'i  ’*■  for  Meteorology,  1 IT  Research  Institute,  Tech.  Memo  28.  Final  Rep"r*  on  Task  i of 
r ontr.ict  N.\Sr-65(25),  Aug.  1968. 

Pannekoek.  A.,  Uber  die  Erscheinungen,  welche  bei  einer  Sternbedeckung  durch  einen  plarcten 
auftreten,  Astron.  Nachrichten,  164,  5-10,  1903. 

Pettit.  E.,  and  Richardson,  R.  S.,  Motion  Pictures  of  the  Occultation  of  Sigma  Arietis  by  Jupiter 
on  November  20,  '9 52,  Publ.  Astron.  Soc.  Pac.,  65,  91-92,  April  1953. 

Phinney,  R.  A.,  and  Anderson,  D.  L„  On  the  Radio  Occultation  Method  for  Studying  Planetary 
Atmospheres,  J.  Geophys.  Res.,  73,  1819-1927,  1968. 

Pirraglia.  J.,  and  Gross.  S.  H.,  Latitudinal  and  Longitudinal  Variation  of  a Planetary  Atmosphere 
and  the  Occultation  Experiment,  Planet.  Space  Sci.,  18,  1769-1784,  1970. 

Pomalaza,  J.  C„  Remote  Sensing  of  the  Atmosphere  by  Occultation  Satellites,  Stanford  Univ. 
Rep.  SEL-69-045,  Sept.  1969. 

Rasool,  S.  1„  and  Stewart,  R.  W.,  Results  and  Interpretation  of  the  S-Band  Occultation  Experi- 
ments on  Mars  and  Venus,  J.  Atmos.  Sci.,  28,  869-878,  Sept.  1971. 

Raytheon  Company,  Electromagnetic  Probing  of  the  Martian  Ionosphere  and  Atmosphere  from 
an  Orbiting  Pair,  Final  Report  on  Contract  NAS-8-21222,  R67-4537,  Dec.  19,  1967. 

Raytheon  Company,  Mariner  Mars  ’71  Missions:  Orbiter-to-Orbiter  Radio  Occultation  Experi- 
ment, Technical  Proposal  R68-4285-1,  June  21,  1968. 

Raytheon  Company,  Electromagnetic  Probing  of  the  Mars  and  Venus  Atmospheres  and  Iono- 
spheres from  an  Orbiting  Pair,  Final  Report  on  Contract  NASw-1772,  R69-4347-1, 2,  Sept.  19, 
1969. 

Shawham,  S.  D.,  Ionization  Rate  and  Electron  Density  Profile  foi  the  Martian  ionosphere  Based 
on  Mariner  IV  Observations,  47th  Annual  AGU  Meeting,  Washington,  D.C.,  April  19-22,  1966. 

Sheer,  1.,  Bravoco,  R.  R.,  Grossi,  M.  and  Langevin,  P.  E.,  Profile  Inversion  Processing  of  Radio 
Occultation  Data  for  the  Determination  of  Planetary  Atmospheres,  Conf.  on  Scientific  Appli- 

Shklovskii,  I.  S.,  Physics  of  Solar  Corona,  Pergamon  Press,  Oxford,  1965. 


Sodek,  B.  A.,  Jr.,  Jovian  Occultation  Experiment,  J.  Spacecraft  Rockets,  5,  461-463,  1968. 


3-84 


Stanford  University,  SPINMAP,  Stanford  Proposal  for  an  International  Network  for  Meteoro- 
logical Analyses  and  Prediction,  Final  Report,  June  1966. 

Stanford  Electronics  Lab.  Proposal  to  NASA  for  a Microwave  Occultation  Experiment  on  the 
Nimbus  E Meteorological  Satellite*  RL-5-68,  Stanford  Univ.,  Feb.  29,  1968. 

Tatarskiy,  V.  I..  Determining  Atmospheric  Density  from  Satellite  Phase  and  Refraction  Angle 
Measurements,  Izv.  Bull.  Acad.  Sci.  USSR,  Atmos.  Ocean  c Phys.,  4.  699-709,  1968  a. 

Tatarskiy,  V.  I.,  The  Accuracy  Achievable  in  Determining  Atmospheric  Density  from  Satelbce 
Phase  and  Refraction  Angle  Measurements,  Izv.  Bull.  Acad.  Sci.  USSR  Atmos.  Oceanic  Phys..  4. 
811-818.  1968  b. 

Vitkevich,  V.  V..  New  Method  of  Investigating  the  Solar  Corona,  Dokl.  Akad.  Nauk..  USSR.  77, 
585-588,1951.  — — ~ 

WaPio,  H.  A.,  and  Williams,  J.  R.,  A Feasibility  Study  of  Determining  Physical  Properties  of  the 
Martian  Atmosphere  by  use  of  Solar  Occultation  as  Seen  from  an  Artificial  Satellite,  NASA  TN 
D-5443,  1 969. 

Wiechert,  E.,  and  Geiger,  L.,  Bestimmung  des  Weges  der  Erdbebenwellen  im  Erdinnern,  Phys.  Z., 
11.294,1910. 

Weisberg,  H.  L.,  The  Study  of  Planetary  Atmospheres  by  Stellar  Occultation,  RM-3279-JPL 
(Contract  N-33561  NAS  7-100),  Rand  Corp.,  Oct.  ' 962. 

Weisberg,  H.  L.,  On  the  Possibility  of  Studying  Planetary  Atmospheres  by  Observing  Stellar 
Occupations  from  a Flyby  Space  Probe  or  Planetary  Orbiter,  Icarus,  2,  226-227.  Oct.  1963. 

Werbowetzki,  A.,  Redaction  Data  with  Multiple  Satellites,  Bull.  Amer.  Mettorol.  Soc.,  47,  199, 
March  1966. 


4.  IONOSPHERIC  SOUNDING 


As  in  Chapters  2 and  3,  we  are  concerned  here  with  active  radar  sounding,  both  monostatic  and  bistatic,  but  at 
wavelengths  such  that  the  probing  signals  are  negligibly  affected  by  the  atmosphere,  but  are  reflected  by  the  iono- 
sphere. The  subject  is  a half-century  old,  and  quite  sophisticated  procedures  have  been  developed  to  invert  the  radar 
data  to  yield  profiles  of  electron  density.  The  experiment  has  been  performed  only  on  the  earth’s  ionosphere, 
although  it  is  applicable  to  any  planetary  ionosphere.  Curiously,  quite  different  techniques  'nd  algorithms  are  in  use 
for  “bottomside”  sounding  and  for  “topside”  sounding.  They  are  described  in  the  following  six  papers.  A com- 
panion paper  on  VLF  sounding  of  the  D region  may  be  found  in  Chapter  8.  The  papers  all  deal  with  pulse,  or  amp- 
litude, sounding  of  the  ionosphere;  i.e.,  only  the  time  delay  characteristics  of  the  radar  echoes  are  used.  More 
sophisticated  techniques  using,  for  example,  phase,  direction  of  arrival,  or  polarization  have  appeared  in  the  litera- 
ture but  have  not  found  extensive  application.  Certain  of  the  papers  in  Chapter  6 on  electromagnetic  scattering  in 
fact  relate  to  this  latter  subject. 

J.  E.  Jackson  organized  and  chaired  the  session  devoted  tc  ionospheric  sounding. 
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THE  P'(f)  TO  N(h)  INVERSION  PROBLEM  IN  IONOSPHERIC  SOUNDINGS 


John  E.  Jackson 

NASA  Goddard  Space  Flight  Center 
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ABSTRACT  ► K7 3 ~ I I $ © 2 


A general  review  is  given  of  the  inversion  techniques  used  to  derive  the  ionospheric  electron  density  N as  a 
function  of  altitude  h from  group  path  P'  versus  frequency  / measurements  obtained  by  vertical  incidence  iono- 
spheric sounders.  The  paper  discusses  the  medium  under  investigation,  the  experimental  techniques  used  to  obtain 
the  P‘(f)  data,  the  theoietical  considerations  leading  to  the  integral  equation  relating  P'(f)  to  N(h),  and  the 
assumptions  made  in  the  inversion  process.  The  lamination  inversion  technique  is  then  presented,  with  special  atten- 
tion given  to  mathematical  difficulties  arising  from  discontinuities  in  the  P'(f)  function,  infinities  in  the  integrand, 
and  in  some  cases  unknown  integration  limits.  Methods  outlined  for  minimizing  the  uncertainties  due  to  discontin- 
uities include  the  use  of  redundant  information  - that  is  the  two  distinct  P'{f)  functions  available  for  a given  N{h) 
profile  - and  the  use  of  models  based  on  statistical  data.  Mathematical  procedures  are  discussed  that  increase  signifi- 
cantly the  efficiency  and  accuracy  of  the  required  numerical  integrations.  The  accuracy  of  the  inversion  technique 
is  deduced  by  comparing  the  resulting  N(h)  profile  with  N{h)  data  obtained  by  simultaneous  but  independent 
observations. 


INTRODUCTION 


Ionospheric  data  obtained  by  the  vertical  incidence  pulse  sounding  technique  have  been  used  extensively  to 
derive  profiles  of  electron  density  N versus  altitude  h.  The  calculation  of  N(h)  profiles  from  the  P\f)  sounder- 
data  requires  the  inversion  of  an  integral  equation  of  the  form: 


where 


p = apparent  range 
/ = frequency 
p - propagation  path 
p0  * location  of  sounder 
pr  = location  of  reflection  point  for  frequency  f 
ri  * group  refractive  index 
N - electron  density 

B « amplitude  of  terrestrial  magnetic  Held  vector  B 
♦ = angle  between  B and  direction  of  propagation 


" 'n*fw 


The  fundamental  formula  for  n'  is: 


(2) 


where  n , the  real  part  of  the  refractive  index  of  the  medium,  is  given  by 

, 2*(1-*) 

« * i : — 

2(1  -*)-  Y\  ± y Y*r+  4(1  -xy  Y\ 


(3) 


where 

x = m l 
/• 

/7V  = plasma  frequency  (MHz)  = 8.98X  10  3 V77”(electrons/cm3 ) 

Y m fH 

~T 

fH  ~ gyrofrequency  (MHz)  = 2.8f? (gauss) 

Yt  = Y sin  4> 

Yl  = * cos  4» 

t = positive  sign  in  front  of  square  root  is  for  the  ordinary  wave;  negative  sign  is  for  the  extraordinary 
wave. 


(4) 

(5) 

(6) 
(7) 


The  ordinary  wave  reflects  when  * = 1 (except  when  4>  = 0)  or  equivalently  [as  seen  from  Eq.  (4)]  when: 

/-/N  (8) 

The  extraordinary  wave  reflects  when  * = 1 - * (if  K < 1)  and  also  when  * = 1 + T.  In  P\f)  terminology, 
however,  only  the  echo  corresponding  to  * * 1 - Y is  called  an  extraordinary  echo,  and  the  echo  corresponding  to 
*=  1 + Y is  called  the  Z echo.  Using  Eqs.  (4)  and  (6)  to  express  the  reflection  conditions  in  terms  of  fN  and  fH 
yields  the  frequency  fx  of  the  extraordinary  echo: 

fx=~(fH+  sr*W+ftf  ) (9) 


and  the  frequency  fz  of  the  Z echo: 


fz=fic~fH 


00) 


Equation  (1)  was  written  in  terms  of  a generalized  propagation  path  p to  indicate  that  the  sounder  data  do  not 
necessarily  correspond  to  “vertical”  incidence.  The  propagation  path  is,  however,  essentially  a vertical  path,  if  the 
electron  density  distribution  is  a function  of  altitude  only  (spherically  stratified  ionosphere).  Most  P'(f)  to  N(h) 
inversion  techniques  assume  that  stratifies tio”  :s  essentially  spherical  within  the  ionospheric  region  from  which  the 
sounding  echoes  are  received  and  that  the  propagation  path  ii  vertical.  Except  when  otherwise  indicated,  these 
assumptions  are  also  made  here  - that  is,  in  Eq.  (1)  p is  replaced  by  h and  4>  is  replaced  by  (90°  - 0)  where  8 is 
magnetic  dip  angle. 
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The  complex  ity  of  the  n function  [Jackson,  1969a]  makes  it  impossible  to  evaluate  analytically  the  integral 
in  Eq.  (1),  except  in  very  special  cases  (4>  * 0 and  4>  * 90°).  Numerical  integration  techniques  have  to  be  used 
[even  for  a very  simple  N(h)  model] , and  special  care  must  be  taken  near  the  reflection  points  where  n becomes 
infinite.  It  should  also  be  noted  that  N(h)  is  the  only  unknown,  since  B and  <t>  are  specified  by  the  sounder 
location  and  can  be  calculated  to  a high  order  of  accuracy  from  a terrestrial  field  model  [Gain  and  Cain,  1968] . 

The  difficulties  encountered  in  the  inversion  of  Eq.  (1)  are  due  primarily  to  gaps  and  discontinuities  in  the  P\f) 
data.  The  redundancy  of  the  data  [availability  ot  more  man  one  P\f)  function)  helps  overcome  these  problems. 

But  in  many  cases  it  is  also  necessary  to  have  some  knowledge  of  the  gross  features  of  the  N(h)  distribution.  A com- 
plete discussion  of  the  inversion  process  must  therefoie 
1 000  aT  “ ~A\  1 include  some  basic  Information  concerning  the  medium 

\ \ under  investigation  end  the  limitations  of  the  P'(f) 

900  - \ \ - measuring  technique.  The  characteristics  of  the  iono- 

\ \ sphere  and  of  the  source,  technique  are  such  that  two 

800  ~ \ \ - sounders  (one  on  the  ground  and  the  other  in  an  earth- 

\ DAYTIME  orbiting  satellite)  are  required  to  permit  a complete 

700  - \ \ - | determination  of  the  ,V(h)  distribution  at  a given  loca- 

\ tion.  Here  we  discuss  the  P'(f)  to  N(h)  inversion 

s 600  - NIGHTTIME  \ \ - problems  associated  with  both  types  of  soundings. 
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Figure  1.  - Vertical  electron  density  distributions  based  upon 
simultaneous  topside  and  ground-based  observations  at 
Wallops  Island,  Virginia  (latitude  37.84s  N,  longitude 
75.47°  W,  fH  - 1.45  MHz,  9 - 69°).  The  daytime  curve 
is  for  June  18, 1968  at  1 200  EST  and  the  nighttime  curve 
is  for  June  4, 1968  at  0345  EST.  The  two  N(h)  profiles 
corresponded  to  the  same  magnetic  index  iKp  ■ 3)  and 
10.7  cm  solar  flux  (S,«  144).  The  altitude  difference 
between  S and  T was  made  about  15  km  greater  than 
observed  in  order  to  show  the  E valley  more  dearly. 


THE  MEDIUM  UNDER  INVESTIGATION 


Since  die  ionosphere  is  produced  primarily  by 
ionizing  radiation  from  the  sun  (offset  by  subsequent 
recombination  and/or  transport  of  the  electron-ion  gas 
thus  created),  the  N(h)  distribution  depends  on  the 
level  of  solar  activity  and  undergoes  diurnal,  seasonal, 
and  latitudinal  variations.  The  terrestrial  magnetic  field 
influences  not  only  the  n function,  but  also  die  N{h) 
distribution.  Both  effects  are  due  to  the  Held  control 
(through  the  V X B mechanism)  on  the  motion  (trans- 
port or  radiowave-induced  oscillations)  of  the  free  elec- 
trons. Since  the  mafnetk  axis  it  not  aligned  with  the 
rotation  axis  of  the  earth,  the  magnetic  control  produces 
longitudinal  variation*  in  die  N(h)  distribution  (in  addi- 
tion to  variations  due  to  local  time  differences). 


corresponded  to  the  same  magnetic  index  v*p  - 3)  and  Although  the  vaiiability  of  the  medium  can  intro- 

10.7  cm  tolar  flux  (S,«  144).  The  altitude  difference  jMn.  ,nm.  i n-  . ..  , 

between  Send  Turned,  .bout  15  km  peeler  dun  dlf*0me  (or  the  cal- 

obterved  in  order  to  ehow  the  E vtUey  more  deeriy.  eolations,  certain  features  of  the  daytime  and  nighttime 

N(h)  distributions  are  sufflciently  regular  - at  least  at 
midlatitudes  - to  provide  a basa  for  the  conventional  division  of  the  ionosphere  into  D,  E,  FI,  and  F2  regions. 
The  boundaries  between  these  regions  have  never  been  precisely  defined,  but  it  is  usually  understood  that  the  1) 
region  is  below  90  km,  the  E region  between  90  and  140  km,  the  FI  region  between  140  and  180  km,  and  the  F2 
region  above  180  km.  The  typical  features  of  the  midlatitude  N(h)  distribution  are  shown  in  figure  1 , a daytime 
and  a nighttime  profile  measured  over  Wallops  Island,  Virginia.  An  important  characteristic  of  the  distributions  is 
the  fact  that  the  electron  density  becomes  significant  at  an  altitude  of  about  80  km  in  the  daytime  and  about  1 53  km 
at  night  (points  Q on  flg.  1)  and  increases  almost  monotonically  up  to  the  height  of  maximum  density  (hmar.F2, 
shown  on  fig.  1 as  point  M),  which  occurs  typically  at  altitudes  between  200  and  400  km.  The  major  exceptions 
to  this  rule  occurs  in  daytime  in  the  region  between  1 10  and  140  km  (£  valley).  Above  hmixF2  (topside  iono- 
sphere) the  electron  density  decreases  monotonically  and  in  very  nearly  exponential  fashion. 
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EXPERIMENTAL  DETERMINATION  OF  THE  P\f)  FUNCTION 


The  ionospheric  sounder,  or  ionosonde,  opera'es  on  principles  similar  to  those  of  radar,  and  it  provides  echoes 
from  the  ionosphere  over  a wide  range  of  operating  frequencies.  Ionospheric  soundings  are  typically  conducted  at 
frequencies  between  0.1  and  20  MHz,  using  100  Msec  pulses  and  a 20-to-60  pulse/sec  repetition  rate.  A complete 
frequency  sweep  takes  typically  15  to  30  sec. 

In  the  widely  used  swepi-frequency  system  the  received  echoes  are  customarily  displayed  in  the  “ionogram” 
format,  in  which  the  echo  round-trip  time  At  is  displayed  (in  the  vertical  axis  direction)  as  a function  of  the  sounder 
frequency  /.  The  quantity  (Ar)c/2  (where  c = velocity  of  light  in  vacuo)  is  the  apparent  range  P'\  it  is  greater  than 
the  distance  to  the  echoing  region  because  the  sounding  signal  is  retarded  by  the  ionosphere.  The  apparent  range  P' 
at  a frequency  fr  is  therefore: 

P'(/,)=y(A'r)  Ul) 

where 


Vq  * VqU.N.B.Q)  - the  velocity  of  the  sounding  signal 

pr  * a reflection  point  for  the  frequency  fr  and  the  integration  is  along  the  ray  path. 

Combining  Eqs.  (11)  and  (12)  gives 

rPr  c 

P'(fr)m  I (l3) 

Jpo  G 

The  quantity  c/Vq  is  the  group  refractive  index  and  Eq.  (13)  becomes 


P 


n dp 


as  given  in  Eq.  (1).  The  vertical  axis  of  the  ionogram  is  calibrated  in  terms  of  P'  and  the  basic  measurement  provided 
by  the  ionogram  is  P'  as  e function  of  frequency. 


CHARACTERISTICS  OF  TYPICAL  P\f)  CURVES 


The  three  reflection  conditions  [Eqs.  (8)  through  (10)]  give  rise  to  the  ordinary,  extraordinary,  and  Z tiaces 
on  ionograms.  Conventional  sounding4  from  the  ground  rarely  yield  usable  Z traces,  but  they  normally  provide  both 
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ordinary  and  extraordinary  traces.  Each  P\f)  trace  can  be  used  (with  appropriate  inversion  techniques)  to  derive 
N(h)  profiles.  The  ordinary  trace  is  generally  the  most  useful  one  on  ground-based  soundings  and  the  extraordinary 
trace  is  the  most  useful  one  on  topside  soundings  ( Jackson , 1969a,  p.  963] . In  both  cases  (topside  and  ground- 
based  soundings),  the  presence  of  at  least  one  trace  in  addition  to  the  one  used  for  primary  data  analysis  provides 
redundant  information  that  van  be  used  to  check  assumptions  made  in  the  inversion  of  Eq.  (1)  and  to  minimize 
uncertainties  due  to  gaps  in  the  primary  data.  Further  redundancy  of  information  is  also  available  on  topside  iono- 
grams,  as  the  result  of  resonance  phenomena  that  occur  at  /TV,  ///,  and  at  the  upper  hybrid  frequency  fT,  given  by 

/T  = y/ffFTJii*  (15) 


These  resonances  permit  the  calculation  of  N and  B at  the  satellite,  which  is  particularly  helpful  when  the  propa- 
gation traces  are  not  clearly  defined  at  the  satellite.  Since  N and  8 (consequently,  /TV  and  fH)  both  decrease 
monotonically  with  altitude  in  the  topside  ionosphere,  the  P'(J)  function  is  known  in  principle  for  both  ordinary 
and  extraordinary  traces  for  the  complete  topside  profile  below  the  satellite  (portion  AM  of  profiles  in  figure  1).  On 
ground-based  soundings,  however,  the  P'(f)  function  is  not  defined  for  the  portion  QR  of  the  profiles  on  figure  1 
(where  R corresponds  to  the  minimum  density  for  which  echoes  are  obtained  by  the  sounder),  arid  for  the  portion 
ST  of  the  daytime  profile. 

The  ordinary  ray  P'(f)  functions  corresponding  to  N(h)  profiles  of  figure  1 are  shown  on  figures  2 and  3 
for  topside  and  for  ground-based  soundings.  To  permit  a direct  comnarison  between  apparent  heights  P'  and  the 
actual  reflection  heights,  the  N(h)  profiles  have  been  redrawn  with  the  density  expressed  in  terms  of  plasma  fre- 
quency [using  Eq.  (5)] . 
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Figure  2.-  Vertical  distribution  of  plasma  frequency  (dashed 
curve)  corresponding  to  the  daytime  A ’(h)  profile  of 
figure  1,  and  corresponding  ordinary  ray  P'(f)  functions 
(solid  curves)  for  topside  and  ground-based  soundings. 
The  virtual  range  scale  is  one-half  the  altitude  scale  to 
keep  the  iono grams  from  overlapping. 


Figure  3.-  Vertical  distribution  of  plasma  frequency  (dashed 
curve)  corresponding  to  (he  nighttime  N{h ) profile  of 
figure  1,  and  corresponding  ordinary  ray  P'(f)  functions 
(solid  curvet)  for  topside  and  ground-based  soundings. 
The  virtual  range  sole  is  one  half  tt*  altitude  scale  to 
keep  the  ionogrami  from  overiaroing. 
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VIRTUAL  RANGE  (KM)  VIRTUAL  RANGE  (KM) 


It  is  seen  that  topside  and  ground-based  soundings  terminate  at  hmaxF2,  which  follows  from  the  monotonic 
variation  of  fN  and  fx  with  electron  density.  Actually,  the  soundings  do  not  quite  reach  ^max^:  the  topside 
soundings  stop  typically  at  a point  M"  (IS  to  30  km  above  point  M of  figure  1)  and  ground-based  soundings  stop 
typically  at  a point  M"  (10  to  20  km  below  Af).  The  region  \fM"  can  be  derived  from  simultaneous  topside  and 
ground-based  soundings  [Jackson,  1969b]  or  by  extrapolation  of  either  type  of  soundings.  This  extrapolation 
usually  assumes  that  the  N(h)  distribution  can  be  represented  by  w ^uapman  function  in  the  vicinity  of  hmaxF 2; 
i.e.,  N(h)  is  assisted  to  be  given  by. 


N ’ A'max  exp  y ( 1 - z - e-2) 


(16) 


\ here 


z 3 {h-hm9X)l H 
H = scale  height 


There  is  one  additional  source  of  information  that  can  be  used  to  check  L <*  total  profile  obtained  by  simultan- 
eous topside  and  ground-based  soundings.  Topside  ionograms  frequently  display  echoes  reflected  from  the  ground 
(ground  trace  or  ground  echoes).  These  echoes,  which  are  for  frequencies  greater  than  the  maximum  frequency  re- 
flected from  the  ionosphere,  exhibit  a delay  that  is  determined  by  the  total  profile  and  by  the  frequency  of  the 
sounder  signal.  It  is  therefore  possible  to  check  the  accuracy  of  the  total  profile  by  comparing  the  observed  ground 
echoes  with  the  ground  echoes  calculated  from  the  total  N(h)  distribution  [Jackson,  1969b] . 


MATHEMATICAL  TECHNIQUES  USED  IN  THE  INVERSION  PROCESS 


Here  we  discuss  the  technique  used  for  the  inversion  of  Eq.  (1)  for  the  case  of  vertical  propagation  - that  is. 


Equation  (17)  is  wntten 


N{h),fr.B(h),e(h ) 


Ah 


(17) 


(18) 


where 


hk*hr 

n'  ««[*(*),/>.*(*),*(*)] 
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For each  interval  Ah“fy+]-h/  (or  lamination),  the  N(h)  function,  approximated  by  a simple  analytic 
function.  The  number  of  lamination*  required  can  be  minimized  by  choosing  a function  which  approximates  the 
N(h)  distribution  over  large  Ah  increments.  A linear  approximation  is  adequate,  however,  if  small  height  incre- 
ments  are  chosen,  and  this  simpler  representation  will  be  used  to  illustrate  the  procedure.  The  linear  substitution: 


h mh(+  aj+i{N—Nj)  dh  ■ «|+|  dAf 


gives  for  a typical  lamination: 


J’Vl  f^i'+l 

I n’dJV  (19) 

ht  JNi 

The  integral  corresponding  to  a given  lamination  is  thus  completely  defined  except  for  a cor.-Zant  which  repre- 
sents the  value  of  dfc/dJV  in  the  lamination.  For  all  laminations,  except  the  last  one  (h^_j,h*),  the  function  n is 
finite*  and  the  integral  on  the  right-hand  side  of  Eq.  (19)  can  be  evaluated  with  great  accuracy  by  numerical  integra- 
tion techniques,  using,  tor  example,  a gausnan  quadrature  formula  with  only  three  coefficients.  The  last  lamination 
terminates  at  the  reflection  point  where  n'  is  infinite.  The  integration  in  this  case  is  based  on  asymptotic  expres- 
sions for  n near  the  reflection  conditions,  which  are  of  the  form: 

• _ a 

n m—pssssss- 

where  a and  0 are  constants  appropriate  for  the  propagation  mode  used.  By  letting  -(W,  the  last  integral  in 
Eq.(19)  becomes: 


At  the  reflection  point,  the  integrand  n't  is  finite  since  n't  ■ n'  Vl  -f W m «• 

Most  of  the  techniques  currently  used  for  the  inversion  of  Eq.  (1)  are  baaed  on  the  above  lamination  concept. 
The  laminations  are  implicitly  specified  by  the  choice  of  sampling  frequencies,  as  illustrated  by  the  following  exam- 
ple based  upon  the  ordinary  h‘(J)  function  for  a topside  sounding.  Let  /, , . . . ,/R  be  the  sampling  frequencies 
and  Ai.hj hR  be  the  corresponding  h'  values.  Let  N{  be  the  density  at  which  reflection  occurs  for  the  fre- 
quency /t,  and  let  be  the  density  at  the  satellite,  corns  h\*  0 (or  equivalently  hl  » 0). 


•There  is  one  exception  to  this  statement,  and  this  occurs  in  the  case  of  7 echoes  on  topside  ionograms  where  n 
becomes  infinite  at  the  start  of  the  first  lamination  when  X ■ (1  - 7* )/ (1  - Y£  ). 
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Using  the  linear  approximation  of  Eq.  (19),  the  virtual  range  values  can  be  written- 


rNi 

hi  =02  I n'U^dS’ 

Jv. 

rVj  . 

/*V< 

/ij=0i  I it  </,  )dA'  + u. 

J n'(/sU 

A' , 

• • • • • • 
r'2 

h'n=  a 2 | »'U,i  )d.V  + • • 

* + 01/  J 

CO) 


Each  integral  in  Eq.  (20)  can  be  evaluated  using  techniques  previously  discussed:  thus,  the  unknown  a(  terms  can  be 
readily  computed. 

For  uie  extraordinary  ray  the  procedure  is  slightly  more  complicated,  because  knowltdge  of  both  f-  and  2ie 
corresponding  at  the  height  h(  is  required  to  compute  N-  [see  Eq.  (9)].  Tire  procedure  in  this  case  is  to  ini- 
tiate the  calculation  of  each  new  lamination  with  an  estimated  value  of  B and  to  refine  this  estimate  by  an  iteration 
process  [Jackson.  1969a,  pp.  973-975, Lockwood,  1970], 

The  iaminations  commonly  used  for  P'(f)  to  A (h)  inversion  are  of  the  form: 

h =hi  + n, •+!(*-♦,•)  + (-0 


where  ♦ is  n.  /A',  or  InTV.  When  a parabolic  lamination  is  used  (fy+|  ^0).  the  additional  lamination  parameters 
are  obtained  by  assuming  that  dhld'V  is  continuous  at  the  lamination  boundaries,  namely: 


ai+ 1 + -fy+l^i+l  * ♦»>  = ai+2 


(22) 


EXPERIMENTAL  CONSTRAINTS  PLACED  ON  THE  P’{f)  F'JNCTION 


Some  difficulties  arise  in  the  inversion  of  Eq.  (1)  because  the  experimental  data  do  not  yield  the  P\f)  function 
for  the  comp’ete  altitude  range  of  interest.  These  difficulties  are  encountered  primarily  with  ground-based  soundings 
due  to  regions  QR  and  ST  (fig.  1)  fot  which  the  P'(f)  functions  are  not  available.  In  some  cases  on  topside  iono- 
grams,  the  extraordinary  P'(f)  function  is  not  defined  with  sufficient  accuracy  at  the  starting  point  A.  This  usually 
arises  when  the  density  at  the  satellite  is  less  than  200  electrons/cc. 
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Unknown  Start  on  Topside  lonograms 

If  the  density  Nt  at  the  satrilite  altitude  ht  is  unknown,  it  can  be  determined  by  the  following  method.  The 
equations  of  the  first  two  laminations  are  assumed  to  be  lir  ear  in  In.V:  *hat  is, 


and 


h = h,  + a2ln 


N_ 

N\ 


for  h2  <h  <hi 


h - h2  +a3  In 


N 


N2 


for  h3<h<h2 


It  i ; then  assumed  that  the  correct  value  of  Nx  will  yield  a2-a3.  Various  value*  of  Nx  are  tried  (by  an  iteration 
process)  until  a2  and  a3  agree  within  0.1%.  This  slope-matching  technique  has  been  tested  extensively  with  iono- 
grams  where  <V,  could  be  accurately  measured  by  conventional  scaling  techniques.  It  was  found  that  for  ionograms 
wheie  .V,  is  well  defined,  conventional  and  slope-matching  techniques  have  comparable  accuracy.  Results  obtained 
by  means  of  slope  matching  have  cast  considerable  doubt  on  the  determination  of  iV,  using  the  method  introduced 
by  Hagg  ( 1967 J . which  is  based  on  topside  sounder  plasma  resonance  observations  under  conditions  of  low  N\ . This 
method  yields  values  of  Nx  that  are  typically  one-third  of  the  slope-matching  technique  values.  Furthermore,  the 
Hagg  “beat”  values  (so-calied  because  they  are  determined  from  the  observed  beat  frequency  between  the  fH  and  fT 
resonances)  yield  rather  questionable  N(h)  profiles  (fig.  4).  Lockwood  {set  p.  4-16)  and  Colin  [private  communication) 
have  aiso  used  the  slope-matching  technique  (or  similar  procedures)  to  compute  Nt  and  have  reached  similar  conclu- 
sions concerning  the  Hagg  beat. 


Unknown  Starts  on  Grcend-Based  Soundings 
The  P'(D  function  for  a ground-based  sounding  can  be  written: 


Figure  4 - Comparisons  between  two  N(h)  profiles  derived 
from  the  same  kmogram  using  the  Hagg-beat  start  (dashed- 
curve  and  the  slope  matching  start  (solid  curve). 


Figure  5.- Rocket  results  (Ref.  11)  showing  D and  E regions. 
The  lower  part  of  the  profile  (for  which  the  P'(f)  function 
is  not  normally  available)  can  be  approximated  by  the  indi- 
cated straight  segments. 
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where  Q and  P refer  to  figure  l.and  refers  to  a reflection  point  above  point  R.  The  altitude  /i q is  the 
altitude  below  which  n can  be  considered  equal  to  unity  for  the  frequencies  of  interest.  The  group  index  n can 
not  be  considered  equal  to  unity  in  the  region  QR,  even  though  this  region  yields  no  echoes  on  conventional  iono- 
grams.  The  procedure  used  are  different  on  daytime  and  on  nighttime  ionograms  and  these  two  cases  will  oe  discussed 
separately. 

Daytime  ionograms.  One  solution  [Jackson, 

1956,  pp.  1 20  1 22] , which  is  similar  to  the  slope- 
matching technique,  is  to  assume  that  the  region  QR  is 
a simple  analytic  extension  of  the  profile  above  point  R. 

Another  approach,  based  on  rocket  results  (fig.  5),  is  to 
assume  that  the  density  is  1000  elections/cc  at  80  km, 
and  that  it  increases  exponentially  up  to  point  R.  This 
gives  fairly  good  results  with  midlatitude,  midday  iono- 
grams. 

Nighttime  ionograms.  Analytic  extensions  of  the 
profile  above  point  R could  also  be  assumed  for  the  re- 
gion QR,  but  more  reliable  results  can  usually  be  ob- 
tained by  making  use  of  the  ordinary  and  extraordinary 
traces,  which  are  of  comparable  quality  on  nighttime 
ionograms.  The  procedure  is  to  assume  an  arbitrary 
density  value  at  Q and  to  compute  from  the  ordinary 
trace  the  correspond! % N(h)  profile.  The  extraordi- 
nary P\f)  function  corresponding  to  this  profile  is  then 
computed  and  compared  to  the  observed  extraordinary 
P'(f)  function  at  m values  of  sounding  frequencies. 

The  standard  deviation  a between  the  computed  and 
the  observed  extraordinary  P'(f)  values  is 


o=\f£(d,-J/m)  (23) 

where  d(  represents  the  vertical  height  differences  at 
/ = The  process  is  repeated  for  other  assumed  densi- 
ties at  Q,  and  the  density  at  Q that  yields  the  mini-  the  density  is  10s  el/cc. 

mum  value  of  o is  taken  as  the  starting  density.  The 

quantity  a for  typical  nighttime  midlatitude  ionograms  is  shewn  in  figure  6 a function  of  the  assumed  plasma  fre- 
quency at  Q.  A value  of  hn=  150  km  was  used  for  this  calculation.  Also  shown  is  the  aiti'ude  at  which  the  electron 
density  reaches  a value  of  l*5  electrons/cc.  Extensive  tests  of  this  technique  have  given  a high  degree  of  consistency 
in  the  1 50-km  starting  value  obtained  at  one  sounder  location  (Wallops  Island),  namely,  0.2  MHz  < (/vV)  < 0.4  MHz. 
Thus  an  empirical  starting  value  of  fN  = 0.3  MHz  could  be  used  in  cases  when  the  two  traces  cannot  be  scaled  with 
sufficient  accuracy  to  permit  the  above  determination  of  Nq. 

The  Valley  Problem 

The  valley  ambiguity  (unknown  region  ST)  can  in  principle  be  substantially  minimized  by  using  a technique 
similar  to  the  one  described  for  the  unknown  start  on  nighttime  ground-based  soundings.  In  this  case,  a model  of  the 
E valley  is  arbitrarily  selected  and  the  quantity  a [Eq.  (23)]  is  computed  for  various  valley  depths.  The  desired 
valley  is  the  one  that  yields  the  minimum  value  of  a.  The  effect  of  the  assumed  valley  on  the  resulting  N(h)  profile 
is  shown  in  figure  7.  The  shape  of  the  valley  (a  triangular  wedge  on  this  semilogarithmic  plot)  is  one  of  many  \Davis 
and  Saha,  1962;  Mal'tseva,  1969]  assumed  ty  other  authors.  Although  physically  unrealistic,  it  is  convenient  for 
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Figure  6.-  The  function  o versus  plasma  frequency  at  150  km 
for  typical  midlatitude  mghtnme  ionograms.  The  effect  of 
the  starting  point  upon  the  tesulting  N(h ) profile  is  indi- 
cated bv  the  dashed  line  which  shows  the  altitude  at  which 
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ELECTRON  OCNSlTt  lEL/CC) 


Figure  7.-  Effect  of  mumed  valley  upon 
calculated  M(h)  profile* 


FREQUENCY  (MHz) 


Figure  8.- Calculated  X trace*  for  N(h)  profile*  of  figure 
7 and  for  profile*  correspond  ins  to  two  additional  valley* 
(not  ihown  in  fig.  7).  The  observed  X trace  value*  are 
shown  by  the  toiid  points. 

sounder.  These  effects  car.  be  taken  into  consideration 


computer  programming.  Furthermore,  the  shape  of  the 
valley  is  too  variable  to  justify  the  use  oi'  a more  aesthetic 
representation.  The  objective  is  to  improve  the  accuracy 
of  the  profile  above  the  valley  (rather  than  to  provide 
meaningful  data  within  it).  The  profiles  shown  in  figure 
7 would  normally  be  derived  from  the  ordinary  trace. 
These  profiler  and  two  additional  ones  tor  a 90  percent 
and  a 70  percent  valley,  respectively,  have  been  used  to 
compute  the  corresponding  family  of  extraordinary 
traces  above  shown  in  figure  8.  The  solid  points 

representing  the  observed  X trace  are  seen  to  agree  best 
with  a valley  of  80  percent. 

The  valley  determination  illustrated  by  figures  7 
and  8 was  possible  because  the  X trace  was  well  defined 
for  frequencies  immediately  above  the  extraordinary  ray 
E region  critical  frequency  fxE.  Often,  this  require- 
ment is  not  met,  and  inspection  of  figure  8 indicates 
that  the  comparison  would  have  been  much  less  mean- 
ingful had  the  ionogram  not  shown  an  X trace  for 
frequencies  under,  say,  4.S  MHz  (particularly  in  view 
of  the  ±5-km  scaling  uncertainty). 

The  valley  calculators  (when  feasible)  yield  fairly 
consistent  results,  and  these  results  can  be  used  to  esti- 
mate the  valley  depth  in  cases  when  the  extraordinary 
trice  cannot  be  used  for  this  purpose.  For  midlatitude 
midday  conditions,  both  valley  calculations,  as  illustra- 
ted above,  and  rocket  data  [Seddon  and  Jackson,  1958; 
Bauer  and  Jackson,  1963]  show  that  the  minimum  E- 
valley  density  is  typically  80  to  90  percent  of  the  density 
at  fjnax  (point  S of  fig.  1). 

PROBLEMS  ARISING  FROM 
NONVERTICAL  PROPAGATION 


The  routine  procedure  for  the  inversion  of  Eq.  (i) 
is  to  assume  that  the  P'(f)  function  represents  virtual 
heights  measured  at  vertical  incidence.  For  B - 0,  this 
assumptioi.  would  be  equivalent  to  stating  that  the  sounder 
is  r eiving  echoes  from  a spherically  stratified  region  of 
tl  nosphere.  This  region  for  a ground-based  sounder 
would  be  a vertical  cylinder  typically  having  a diameter  of 
10  km.  For  a topside  sounder,  which  an  move  horizon- 
tally up  to  200  ion  while  a complete  ionogram  is  obtained, 
the  spherical  stratification  would  have  to  hold  over  a ver- 
tical region  whose  projection  on  the  earth’s  surface  would 
be  roughly  a 20-  by  200-km  rectangle.  There  is  usually  a 
change  in  altitude  (and  also  some  minor  changes  in  B and 
8)  associated  with  the  horizontal  motion  of  the  topside 
(see  Lockwood’s  discussion,  p.  4-26)  by  assuming  that 


4-12 


the  N(h)  profile  is  a function  of  altitude  only  and  that  the  successive  soundings  oegin  at  a new  reference  altitude 
(and  corresponding  density). 


The  spherical  stratification  concept  usually  applies  to 
the  constant  electron  density  contours,  and  for  B m 0 this 
assumption  would  yield  constant  phase  index  contours.  The 
presence  of  the  terrestrial  magnetic  field  introduces  modifi- 
cations in  the  constant  phase  index  contours,  and  causes  the 
ray  paths  to  be  deviated  from  the  vertical  mostly  near  the 
reflection  points.  Theoretical  investigations  of  this  problem 
[Colin,  private  communication]  indicate  that  no  serious 
errors  are  introduced  in  the  virtual  height  calculations  by 
assuming  that  the  phase  index  contours  are  the  same  as  the 
electron  density  contours.  A more  important  effect  of  the 
magnetic  field  is  the  occasional  presence  of  field-aligned 
irregularities,  which  can  act  a»  wave  guides  and  cause  echoes 
(in  the  extraordinary  mode)  tu  be  field  aligned.  This  effect 
can  be  recognized  by  an  experienced  observer,  and  the  anal- 
ysis (for  4*  =0)  yields  a field-aligned  N(h)  profile.  An  Figure  9.-  Electron  density  contours  derived  from 

example  of  this  type  of  analysis  is  shown  in  figure  9 topside  field-aligned  traces. 

Departures  from  spherical  stratification  can  sometimes  be  inferred  from  a time  sequence  of  ionograms.  In 
the  case  of  ground-based  soundings  it  can  be  detected  (when  severe  tilts  are  present)  from  the  fact  that  multiple 
echoes  (two  or  more  round  trips)  do  not  give  virtual  heights  in  harmonic  relationship  to  the  first  (one  round  trip) 
echo  virtual  height.  Departures  from  spherical  stratification  could  Jso  be  inferred  from  a closely  spaced  network 
of  ground-based  sounders.  Analysis  techniques  that  can  take  into  consideration  these  departures  from  spherical 
stratification  are  currently  under  investigation  (see  McCulky's  discussion,  p.  4-36).  Routine  inversion  techniques, 
however,  assume  that  the  N(h)  distribution  corresponding  to  a given  ionogram  is  a function  of  altitude  only,  and 
that  this  N(h)  distribution  does  not  change  significantly  during  the  10  to  30  sec  required  to  obtain  an  ionogram. 


ACCURACY  OF  THE  INVERSION  TECHNIQUES 


The  accuracy  of  the  mathematical  techniques  used  in  the  inversion  of  Eq.  ( 1 ) has  been  tested  extensively  [Jackson , 

1969a,  pp.  967-970] . These  tests  have  shown  that  the  integration  techniques  do  not  introduce  significant  errors.  For 
example,  on  topside  ionograms  the  error  in  altitude  for  a given  density  is  typically  less  than  1 km.  Some  errors  are 
introduced  in  the  inversion  of  Eq.  (1)  for  ground-based  soundings  due  to  the  d.scontinuities  in  the  P\f)  function  at 
the  starting  point  and  at  the  E valley.  These  discontinuities  introduce  errors  in  altitude  (up  to  10  km)  at  the  starting 
point  and  just  above  the  E valley.  The  errors  above  the  E valley  decrease  monotonically  with  altitude,  and  near 
hmax^2  the  calculated  N(h)  profiles  are  affected  by  the  uncertainties  at  the  lower  altitudes  by  at  most  1 or  2 km. 

Thus,  it  would  be  expected  that  ionograms  obtained  simul>  .neously  by  a ground-baser1  ionosonde  and  by  a 
topside  sounder  directly  overhead  would  yield  N(h)  profiler  tha,  should  agree  near  hmaxF2.  Comparisons  of  this 
type  have  yielded  disagreements  [Jackson,  1969b]  that  are  several  times  greater  than  the  errors  attributable  to  scaling 
accuracy  or  the  mathematical  techniques  used  for  the  inversion  of  Eq.  (1).  The  results  suggest  that  the  error  is  roughly 
proportional  to  the  length  of  the  propagation  path.  Since  the  maximum  propagation  path  (within  the  ionosphere)  is 

usually  several  times  greater  for  a topside  sounding  than  for  a ground-based  sounding,  one  would  attribute  this  discre-  

pancy  primarily  to  the  calculated  topside  N(h)  profile.  On  this  basis,  the  topside  profile  appears  too  low  by  about 
3 to  5%  of  the  distance  d between  the  topside  sounder  and  the  reflection  point.  One  test  for  the  accuracy  of  the  re- 
sulting total  (or  composite)  profile  is  to  calculate  the  corresponding  total  content  f**  Ndh  (where  hs  is  the  satel- 
lite altitude)  and  compare  the  results  with  an  independent  measurement  of  the  total  content.  Comparisons  of  this 
type  using  data  from  GEOS-2  to  compute  $Ndh  have  shown  consistently  that  the  total  content  for  the  calculated 
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composite  profile  was  too  low.  By  raising  the  topside  proale  3 to  4%  of  the  distance  d,  the  two  measurements  of 
fNdh  can  be  brought  into  agreement.  The  small  but  puzzling  discrepancy  between  topside  and  ground-based  sound- 
ing suggests  that  some  cumulative  errors  arise  in  the  observations,  perhaps  due  to  irregularities  in  the  ionosphere  that 
cause  the  propagation  paths  to  deviate  from  the  vertical  direction.  The  discrepancy  does  not  seem  to  be  due  to  large- 
scale  departures  from  spherical  stratification,  as  could  be  inferred  from  the  analysis  of  ionogram  sequences. 

Another  possibility  is  that  cumulative  errors  arise  in  the  analysis  due  to  one  of  the  many  assumptions  made  in 
the  magnetoionic  theory  such  as  cold  plasma  treatment  of  the  ionosphere,  WKB  approximation,  idealized  'flection 
conditions,  and  group  velocity  representation  of  the  signal  velocity.  Although  these  approximations  have  been  accep- 
ted for  several  decades,  a complete  evaluation  of  their  effects  has  not  been  made  in  the  topside  ionosphere. 


SUMMARY 


The  P'if)  to  N(h)  inversion  in  ionospheric  soundings  is  based  on  the  following  assumptions: 

1 . Spherically  stratified  medium  (except  in  special  techniques  under  development  that  make  allowance  for 
gradients). 

2.  Medium  that  does  not  change  during  the  sounding. 

3.  Vertical  propagation  (except  in  more  elaborate  techniques  used,  for  example,  to  deal  with  geomagnetic 
field-aligned  propagation). 

4.  Simple  ray  theory  of  wave  propagation  in  the  ionosphere. 

The  most  accurate  inversion  technique  is  based  on: 

1 . Lamination  concept  and  step-by-step  solution. 

2.  Lamination  model  optimized  for  representative  N(h)  distributions. 

3.  Numerical  integration  with  change  of  variable  to  keep  the  integrand  finite. 

4.  Iteration  (with  topside  extraordinary  P'if)  function)  to  find  the  upper  limit  of  integration. 

Special  problems  arise,  particularly  with  the  P'if)  function  obtained  by  ground-based  soundings, 
when  the  ^(0  function  is  not  defined  over  the  complete  altitude  range  of  interest.  This  leads  to 
(1)  the  starting  point  problem,  and  (2)  the  valley  problem  (for  daytime  grountibased  soundings). 


The  solutions  to  these  special  problems  are  based  on: 

1.  Redundancy  of  the  data  (ordinary  and  extraordinary  P'(f)  functions). 

2.  Estimated  parameter  values  derived  from  statistical  data  accumulated  from  previous  P'if)  to  N(h)  inver- 
sions, or  from  other  techniques,  such  as  in  situ  rocket  measurements. 

3.  In  some  cases,  extrapolations  or  slope-matching  techniques. 

The  accuracy  of  the  N(h)  profiles  derived  by  inversion  of  the  P'if)  function  is  typically  better  than  5%  of 
the  length  of  the  propagation  path  in  the  ionosphere.  The  errors  in  the  Nih)  profiles  are  apparently  due  to  assump- 
tions made  in  the  analysis,  since  they  are  several  times  greater  than  the  error  attributable  to  the  mathematical  tech- 
niques used  in  the  inversion  process.  Although  the  human  element  was  not  stressed  in  this  paper,  it  is  an  important 
factor  in  the  ultimate  accuracy  achieved  with  a given  inversion  technique:  It  enters  into  the  initial  recognition  and 
accurate  scaling  of  the  P'if)  function,  the  evaluation  of  the  validity  of  the  aaumptions  made  for  a given  experimen- 
tal condition,  and  the  final  acceptance  (or  rejection)  of  the  results  given  by  the  inversion  process. 
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ABSTRACT 


A method  for  converting  topside  sounder  ionograms  into  topside  electron  density  profiles  is  discussed.  The 
lamination  method  used  is  modified  to  take  into  account  the  variation  of  electron  density  a id  magnetic  field  within 
each  lamination.  Also  included  is  a change  of  variable  to  produce  a finite  integrand  of  the  integral  involved,  an  itera- 
tion scheme  that  permits  convergence  on  an  initially  unknown  density,  a second  iteration  scheme  to  overcome  the 
problem  of  an  uncertainty  in  the  electron  density  at  the  satellite,  and  a modification  to  compensate  for  the  chang- 
ing satellite  altitude  over  the  duration  of  the  sounding.  Two  applications  of  the  technique  ?..e  discussed:  field- 
aligned  traces  for  computing  field-aligned  profits  and  computer-aided  systems  for  scaling  ionograms. 


INTRODUCTION 


Many  schemes  for  converting  topside  ionograms  into  electron  density  profiles  have  been  devised  during  the 
years  since  the  launch  of  the  first  topside  sounder  in  the  Alouette  1 satellite.  The  most  accurate  and  most  widely 
used  method  has  been  described  in  detail  by  Jackson  { 1969] . The  technique  starts  with  the  simple  concept  of  the 
lamination  method  of  inversion,  which  is  equivalent  to  the  inversion  of  a triangular  matrix.  Then  it  incorporates 
several  complicating  modifications,  all  of  which  are  needed  to  obtain  the  desired  accuracy  of  the  result.  These 
modifications  include  a second-order  logarithmic  model  for  the  variation  of  height  with  electron  density  within 
each  lamination,  an  allowance  for  the  change  in  magnetic  field  v'ithin  each  lamination,  a change  of  variable  to 
keep  the  function  being  integrated  finite,  an  iteration  permitting  convergence  on  an  initially  unknown  limit  of 
integration,  an  iteration  scheme  to  overcome  the  problem  of  an  inaccurate  starting  electron  density,  and  a 
procedure  to  account  for  the  changing  satellite  altitude. 

Here  we  summarize  the  main  points  of  topside  N{h)  reduction  with  emphasis  on  how  our  knowledge  of  the 
ionosphere  has  influenced  the  particular  solution  to  the  problem,  and  discuss  two  special  applications  of  the  tech- 
nique: the  use  of  field-aligned  ionograms  for  computing  N(h)  profiles  and  computer-aided  systems  for  so  ling 
ionograms. 


COMPUTATION  OF  TOPSIDE  N(h)  PROFILES 


Topside  ionograms  [Hagretal.,  1969;  Jackson,  1969)  are  the  range  versus  frequency  records  obtained  from  a 
topside  sounder  [Franklin  and  Maclean , 1969] . Topside  ionograms  have  many  varied  forms  and  features;  the  iono- 
gram  shown  in  figure  1 was  selected  to  illustrate  clearly  the  Z-,  O • and  Jf-wave  traces.  Also  identified  is  the  Z'  trace 
[Calvert,  1966]  and  various  plasma  resonances  [Hogg  etaL,  1969] . Jackson  has  discussed  the  convention  (followed 
here)  of  expressing  echo  time  delay  as  “apparent  range”  in  kilometers. 
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The  Z-.  O,  or  A-wave  trace  may  be  used  independently  to  compute  an  electron  density  (A'(Jr)l  profile, 
although  the  Z-wave  trace,  because  of  its  cutoff,  usually  provides  only  a partial  profile  Figure  2 from  Jackson  (pri- 
vate communication]  shows  N(h)  profiles  computed  from  the  Z-,  0-,  and  A-wave  traces  in  figure  1.  The  profiles 
were  computed  by  the  inversion  of  the  integral 


n[JV(/i),/;fl(h).0(h)]dh 


P (f)  * apparent  range  at  the  wave  frequency  / 
ho  * height  of  the  sounder  (satellite  altitude) 

hr  - height  at  which  reflection  occurs  for 
wave  frequency  / 

n'  = group  refractive  index 

/ = wave  frequency 

B * magnitude  of  the  earth’s  magnetic  field 

0 = angle  between  the  earth’s  magnetic  field- 
line and  the  wave  normal 

N * the  electron  density 

The  problem  of  calculating  N(h)  from  the  curve  p\f) 
scaled  from  a topside  ionogram  is  discussed  below. 

Topside  Lamination  Method 

In  the  basic  lamination  method,  the  integral  is 
broken  up  into  many  parts,  each  being  determined 
explicitly  by  the  points  (p\f)  selected  from  the  iono- 
gram. Initially,  both  ho  and  Na  are  known;  then: 


Figure  1.-  Example  of  an  Alouette  II  ionogram  with 
clear  Z-.  O-.  and  A wave  traces. 
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Figure  2.-  Electron  densities  obtained  from  the  ionogram 
in  figure  1 with  an  independent  N(h)  calculation  for 
each  of  the  Z-.  0-.  and  A-wave  traces.  The  points 
si  town  in  the  graph  were  selected  from  the  computed 


From  Eq.  (2)  one  finds  ft, , from  (3)  one  finds  h2 , and  from  (4)  one  can  obtain  hj.  But  the  evaluation  of 
/ n'(N,f,B.8)dh  is  not  straightforward  since  hj  and  Nj  are  unknown  (although  Nj-\,hj-\,Bj-\  are  known) 
and  both  B and  N change  withir.  the  integral.  Thus,  models  that  define  N and  B within  the  lamination  are 
needed  before  one  can  numerically  determine  the  integral  of  n. 

N(h)  Model 

Since  the  electron  density  in  the  topside  ionosphere  varies  approximately  exponentially  with  height,  a 
suitable  choice  for  the  definition  of  height  with  density  within  the  lamination  between  points  ;-l  and  j is. 


b ~hj-\  + «/  1°8 


bj  log2 


(5) 


If  the  term  in  brackets  is  neglected,  one  has  a “linear  in  log  JV”  model,  which,  though  simple,  has  a discontinuity  at 
the  edge  of  each  lamination.  The  term  in  brackets  is  necessary  if  the  derivative  is  to  be  made  continuous;  the  result 
is  a “parabolic  in  log  N”  model.  The  term 


V 1 


Nj 

ai + 2bi log  FT 

/"I 


(6) 


is  the  “scale  height”  at  the  point  / and  is  derived  from  the  condition  that  the  derivative  be  continuous  at  each  boun- 
dary. This  is  a meaningful  physical  parameter  since  the  scale  height  is  a measure  of  the  electron  and  ion  temperature 
Te  + Tj  and  mean  ionic  mass  M.  For  an  ionosphere  in  diffusive  equilibrium,  the  scale  height  is  given  by: 


"r'V  i 


Te  + Ti 


(7) 


where  k = Boltzmann’s  constant.  This  model  for  electron  density  has  the  advantages  that  dJV/d/i  is  continuous  over 
the  complete  profile;  the  scale  height,  a pertinent  physical  parameter  is  implicit  in  the  model,  and  the  model  is  mono- 
tonic in  N,  as  is  the  electron  density  in  the  topside  ionosphere. 


Magnetic  Field  Model 

The  magnetic  field  at  any  known  height  and  location  can  be  calculated  to  high  accuracy  from  a modern  spheri- 
cal harmonic  model  such  as  the  International  geomagnetic  reference  field  [Grin  and  Cain , 1968] . Thus,  the  field 
Bj- 1 at  the  top  of  the  ;th  lamination  is  known,  and  its  value  at  other  heights  within  the  lamination  can  be  calcu- 
lated to  sufficient  accuracy  by  assuming  an  inverse  cube  variation  with  height.  Jackson  [1969]  has  combined  these 
two  models  in  such  a way  that  the  magnetic  field  is  defined  as  a function  of  electron  density. 


Change  of  Variable 

As  the  wave  propagates  downward  to  its  reflection  height  the  group  velocity  Vg  ot  the  wave  approaches  zero, 
and  since  n'  = c/Vg,  the  integrand  n approaches  infinity.  The  following  change  of  variable  [Shim  and  Wale , 1952; 
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Jackson , 1956]  is  introduced  to  produce  an  integrand  that  is  finite  at  reflection  and  permits  a more  accurate 
numerical  evaluation  of  the  integral: 


f2  = 1 - X for  the  O wave  (81 

y 

r2  = 1 - ■■  ■■  for  the  X wave  (9) 

I-Yr 

where 

X = /JV2//2 
YR  = /%/ 

/Hr  = gyiofrequency  at  the  height  of  reflection 


Numerical  Integration 

Generally,  the  accuracy  of  a numerical  integration  may  be  determined  by  two  methods:  (1)  An  analytic 
evaluation  of  the  integral  (if  possible)  provides  the  correct  value  of  the  integral;  and  (2)  the  number  ci  points  used 
in  the  evaluation  of  the  numerical  integration  can  be  increased  until  no  change  occur  - in  the  result.  Such  comparisons 
[Jackson,  1969]  have  shown  that  provided  the  above  change  of  variable  is  performed,  a three-point  gaussian  numeri- 
cal integration  produces  an  error  of  less  than  0.1%  for  the  2f-'vave,  and  an  error  of  less  than  0.1%  for  the  0-wave, 
provided  0 is  greater  than  20°.  For  the  0 wave,  the  percentage  error  increases  as  0 -*  0,  and  a 16-point  gaussian 
numerical  integration  is  required  to  produce  au  error  less  than  0.1%  when  0 = 5°. 


Iteration 

For  the  lamination  in  which  reflection  occurs,  the  lower  limit  in  height  and  its  gyrofrequency  (or  B)  are 
unknown.  For  the  X wave  the  density  is  unknown  also,  since  it  is  a function  of  both  the  wave  frequency  and  gyro- 
frequency.  One  can  make  some  initial  guess  for  the  gyrofrequency  at  the  bottom  of  the  lamination,  perform  the 
integration  to  determine  the  height  hj,  and  calculate  the  actual  gyrofrequency  at  this  height.  Then  one  repeats  the 
integration  and  continues  the  iteration  until  the  new  value  of  hj  is  the  same  as  the  old  value.  When  the  Jf-wave 
trace  is  scaled,  such  an  iteration  is  absolutely  necessary  since  the  guess  for  fH  determines  the  estimate  for  the  elec- 
tron density  at  the  bottom  of  the  lamination.  Jackson  [1969]  has  shown  that  a useful  first  guess  for  the  gyrofre- 
quency at  reflection  /Hr  is  that  corresponding  to  some  minimum  permissible  increase  in  density  over  that  at  the 
top  of  the  lamination.  For  example, 

A/  (guess)]  = 1.001A/-1  (A,_i  ~/W/-i)  fl°) 

implies  a 0.1%  increase  in  density  at  the  / th  point  over  that  at  the  /-1th  point. 

An  example  of  such  an  iteration  is  shown  in  figure  3.  The  initial  guess  of  / Hr , calculated  from  the  above 
expression,  leads  to  convergence  at  the  desired  height  at  point  E.  Lockwood  [1970]  has  shown  that  the  above 
iteration  will  diverge  foe  a restricted  class  of  ionograms  even  though  a solution  exists.  An  ionogrtm  of  this  class 
is  shown  ir.  figure  4.  The  iteration  for  the  first  lamination  in  this  ionogram  is  shown  in  figure  5.  The  iteration 
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Figure  3.-  An  illustration  of  the  iteration  technique  that  con- 
verges on  the  desired  intersection  point  E from  an  initial 
guess  for  fH^  corresponding  to  a 0.1  percent  increase  in 
density  over  the  density  at  the  top  of  the  lamination. 
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Figure  5.-  An  example  of  an  iteration  (A'B'C'D')  which 
diverges  and  a modified  technique  which  converges  to  the 
desued  point  E,  In  the  modified  technique,  an  initial 
guess  A leads  to  the  points  B,  C,  and  D.  A linear  inter- 
polation using  points  A,  B,C  and  D result  in  the  approx- 
imate height  E.  The  same  steps  are  repeated  using  the 
height  E and  the  actual  gyrofrequency  at  that  height  as 
a new  starting  point. 
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Figure  4.-  An  io nogram  in  which  the  density  at  ths  satellite  is 
small.  The  difference  frequency,  fT  - fH , is  ot  served  as  a 
modulation  on  the  combined  ///,  and  fT  spikes. 
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Figure  6.-  The  N(h)  profile  computed  from  the  scaled  points 
in  the  ionogram  of  figure  4.  The  point  at  2498  km  is  the 
result  of  the  modified  iteration  shown  in  figure  5. 
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Figure  7.-  An  ionogram  which  illustrates  the  broad  fN  spike 
at  low  electron  densities,  and  the  merging  of  the  fxS  with 
fT  and  fH  at  low  densities. 
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A B C D'  diverges  because  the  slope  of  the  height  of  reflection  curve  (versus  assumed  gyrofrequency)  has  a larger 
absolute  value  than  that  of  the  slope  of  the  actual  gyrofrequency-height  curve.  A modification  of  the  iteration, 
which  always  converges  if  a solution  exists,  is  as  follows: 

1.  Perform  the  standard  iteration  for  two  steps  only  to  define  the  points  B,  C,  and  D from  the  starting 
point  A. 

2.  If  the  difference  in  height  between  points  B and  D is  less  than  some  small  value,  the  iteration  is  ter- 
minated and  pr  int  D is  taken  as  the  correct  solution. 

3.  Otherwise,  perform  a linear  interpolation  by  using  the  lines  AC  and  BD  to  find  the  approximate 
intersection  point  E.  This  height  and  the  actual  gyrofrequency  at  this  height  are  then  used  as  the 
coordinates  of  a new  starting  point  in  step  1 . For  this  modified  iteration,  an  initial  guess  for  fH ^ 
corresponding  to  an  increase  in  density  of  10  percent  was  used  since  this  increase  [Jackson,  1969] 
is  typical  of  many  scaled  ionograms,  and  this  guess  permits  a slightly  more  rapid  convergence.  The 
resultant  profile  is  shewn  in  figure  6. 

The  Starting  Density  Problem  (For  Densities  Less  Than  1000  Electrons/cc) 

As  the  electron  density  at  the  satellite  approaches  zero,  the  fxS  approaches  fHS  since  fNS2  = fxS 
( fxS  -fHS).  The  relative  accuracy  to  which  fxS  can  be  scaled  decreases  as  the  density  approaches  zero  since 
fxS  merges  with  the  gyrofrequency  resonance  of  the  lonogram  (see  figure  7).  A starting  density  (or  fxS)  that 
is  too  small  produces  a “knee”  in  the  N(h)  profile,  in  which  the  slope  of  «he  first  lamination  is  smaller  than  the 
average  slope  of  subsequent  laminations.  From  a knowledge  of  the  ionosphere,  one  expects  the  scale  height  (or 
slope)  to  be  constant  or  to  increase  with  height.  Thus,  an  incorrect  starting  density  produces  an  obvious  error  in  the 
N{h)  profile. 

Some  possible  schemes  to  circumvent  this  problem  are: 

1.  The  plasma  frequency  fN  can  be  scaled  to  determine  the  Parting  density  i Jackson , 1967  j . This  could 
provide  a somewhat  more  accurat ; measurement  of  de  ty  but  has  two  disadvantages.  As  shown  in 
figure  7 the  fN  spike  on  the  ionogram  is  usually  broad  in  frequency  with  a corresponding  uncertainty 
in  its  nominal  center  frequency.  Also,  since  current  topside  sounders  begin  at  100  kHz,  the  minimum 
observable  plasma  frequency  is  100  kHz,  corresponding  to  an  electron  dens  ty  of  124/cc. 

2.  Hagg  [1967]  h3S  shown  that  as  the  electron  density  approaches  zero,  the  fH  and  fT  resonances  can 
be  simultaneously  excited  by  the  sounder  transmitter  pulse.  The  difference  frequency  between  the  two 
resonances  is  observed  on  the  ionogram  as  a modulation  on  the  single  spike  produced  by  the  two  reso- 
nances, an  example  of  which  is  shown  in  figure  4.  An  electron  number  density  can  be  calculated  by 
assuming  that  each  resonance  occurs  at  its  nominal  frequency.  Then: 


fN 7 = 2 fH  bf 


(11) 


where  A/=  beat  frequency.  However,  Lockwood  and  Jackson  [private  communication]  have  found  that 
the  starting  density  calculated  by  this  method  is  too  small,  since  it  produces  an  N\h)  profile  whose  first 
scale  height  is  smaller  than  the  second.  The  staiting  density  must  be  increased  by  a factor  of  2 or  3 to 
produce  an  N(h)  profile  in  which  the  fi.^t  s^de  height  is  at  it  as*.  ;s  large  as  the  second.  The  error  in  the 
calculation  of  fN  by  this  method  may  be  caused  by  the  nppaient  shift  of  the  fH  and  fT  resonances 
from  their  nominal  frequencies  [Btnson,  1969;  Bitoun  elal.,  1970] ; on  the  other  bind,  the  calculated 
value  may  represent  the  actual  density  in  the  proximity  of  the  satellite  and  not  me  average  density  at  that 
altitude.  Nevertheless,  until  the  fH  and  fT  resonances  are  better  understood,  this  method  cannot  be 
used  to  calculate  the  starting  density  for  N(h)  reduction. 
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3.  Since  t.ie  scale  height  is  expected  to  be  constant  or  <o  increase  with  altitude,  one  could  try  several  values 
of  fxS  and  select  the  resulting  profile  that  appears  to  be  correct  [Jackson,  1967] . More  specifically, 
for  an  isothermal  ionosphere  in  diffusive  equilibrium  [Eq.  (7)]  the  scale  height  at  different  altitudes  will 
be  the  same  except  for  the  variation  due  to  the  changing  gravitational  field.  The  geopotential  scale  height 


"/ 


// 

/ 


(12) 


in  which  the  gravitational  change  is  removed,  will  not  change  with  altitude  under  the  above  conditions.  For 
a realistic  N{h)  profile,  the  geopotential  scale  height,  H\  at  the  bottom  of  the  first  lamination  should  be 
equal  to  the  geopotential  scale  height,  at  the  bottom  of  the  second  lamination.  Thus,  one  could  per- 
form the  A'(Jt)  calculation  for  the  first  and  second  laminations  only,  and  then  test  to  see  whether 
|//|  - H'i | <e.  If  the  inequality  is  satisfied  then  the  N(h)  calculation  is  continued.  Otherwise,  if  H\ 

;s  less  than  H'j , N0  is  increased  by  some  increment  and  the  calculation  is  started  again,  if  H\  is  greater 
than  H'i . N0  is  decreased  by  some  incmment  and  the  calculation  started  again.  This  technique  need  not 
be  limited  in  use  to  low  electron  densities:  it  could  be  used  for  any  ionogram  in  which  the  accuracy  ol 
scaling  fxS  or  fS  is  poor.  At  CRC  the  iteration  is  performed  if  A'0  is  less  than  1000/cm3,  with 
e = 50  km. 


Changing  Altitude  of  the  Sounder 

For  topside  sounders  that  are  not  in  a circular  orbit  (Alouette  11  and  ISlb-i ),  the  change  in  the  satellite  altitude 
during  the  measurement  of  a single  ionogram  can  be  of  importance  when  high  accuracy  is  required  [Eccles,  private 
communication] . Typically,  for  these  satellites,  the  maximum  vertical  velocity  is  about  1 km/sec.  Thus,  an  error 
of  up  to  10  km  is  introduced  by  not  taking  in'o  account  the  changing  altitude  of  the  sounder  over  the  duration  of 
the  trace  on  the  ■ onovram.  Often,  this  error  is  no  more  significant  than  the  uncertainties  resulting  from  the  scaling 
of  the  ionogram.  But  for  a well-defined  ionogram,  the  accuracy  of  the  N(h)  profile  is  improved  by  the  correction 
described  below. 

The  change  in  the  satellite  altitude  is 


Ah  = A f 


dhsldt 
d/7  d/ 


(13) 


where  Af  ~ fj  -f0.  The  modified  starting  height  for  the  profile  is 


hu  = ho  + Ah 


(14) 


The  density  at  this  height 


N',,  = N0  exp 


is  derive^  from  an  extrapolation  of  the  first  lamination  (h  =*  h0  + a,  log  N/N0 ) to  h'0.  The  point  (h'0 , N‘0) 
instead  of  the  point  ( hQ , Na ) is  used  as  a limit  of  the  first  integral  in  Eq.  (4). 
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SPECIAL  APPLICATIONS 


Field  Aligned  N(h)  Reduction 

Several  workers  including  Colin,  Hojo,  Jackson  (private  communications]  and  this  author  have  shown  that  if 
one  uses  the  Af-wave  field-aligned  trace  to  compute  an  electron  density  profile  along  a magnetic  field  line,  one  ob- 
tains densities  that  agree  with  densities  from  vertical  topside  profiles  at  locations  where  the  two  profiles  intersect 
Thus,  the  capability  exists  for  mapping  the  electron  density  in  the  direction  of  the  magnetic  field,  as  well  as  the 
vertical  direction. 

The  modifications  that  are  required  to  change  the  vertical  N(h)  reduction  program  to  a field-aligned  N(h) 
reduction  program  are: 

1.  The  angle  0 :s  changed  from  90°-DlP  for  vertical  propagation  to  0 = 0°  for  field-aligned  propagation. 

2.  For  each  lamination,  the  increment  (A S - Ah.  = hj  - fy-j)  is  treated  as  a step  in  the  direction  of  the 
earth's  magnetic  field  instead  of  the  vertical  direction. 

Muldrew  [1963]  has  performed  field-aligned  ray  tracing  by  using  an  electron  density  model  containing  a field- 
aligned  discontinuity;  he  showed  that  as  the  energy  propagated  along  the  field,  the  angle  0 remained  less  than  10° 
except  at  reflection.  In  the  use  of  the  field-aligned  N{h)  reduction  program,  no  changes  result  in  the  field-aligned 
piofile  as  0 is  changed  from  0°  to  10°.  This  justifies  the  use  of  the  value  0 - 0 in  computing  field-aligned  profiles. 


A Semiautomatic  Scaling  System 

An  ionogratr,  contains  more  information  than  the  minimum  required  to  compute  an  N(h)  profile.  In  principle 
either  the  O-  or  the  Af-wave  trace  could  be  used  to  calculate  the  N(h)  profile.  The  other  trace  could  then  be  used 
to  check  the  validity  of  the  interpretation  and  scaling  of  the  ionogram.  Lockwood  [1969]  has  described  such  a 
computer-assisted  scaling  system  in  which  an  operator,  in  exjunction  with  a computer,  performs  the  scaling  of  the 
Af-wavc  trace  of  an  ionogram  and  the  calculation  of  the  N(h)  profile  and  corresponding  O-wave  trace,  all  in  real 
time.  A difference  between  the  calculated  and  actual  O-wave  trace  on  the  ionogram  woulu  indicate  an  error  in 
scaling  or  a misinterpretation  of  ;iie  traces  on  the  ionogram.  The  operator  can  rescale  the  ionogram  until  a satistac- 
toiy  agreement  is  obtained  between  the  actual  and  calculated  O-wave  traces.  Figure  8 shows  an  incorrect  scaling 
and  figure  9,  a correct  scaling  of  an  ionogram  and  its  corresponding  computed  O-wave  trace.  In  figure  8 the  error 
in  the  scaled  Af-wave  trace  at  1 .6  MHz  produces  a corresponding  error  in  the  O-wave  trace  at  1 .0  MHz. 

More  recently,  both  Lockwood  ana  Colin  [private  communication]  have  developed  similar,  more  sophisticated 
semiautomatic  systems,  with  more  features  to  aid  the  operator  in  the  interpretation  of  the  ionogmm.  For  example, 
at  hit1’,  latitudes,  the  vertical  reflection  traces  are  often  masked  by  spread  echoes.  However,  by  redisplaying  the  iono- 
gram on  the  CRT  at  a higher  amplitude  threshold  level,  the  spread  is  removed  from  the  displa”  (but  not  from  the 
original  data)  and  the  reflection  traces  become  clearly  visible,  as  shown  in  figures  10  through  12.  Figure  10  shows  a 
conventional  ionogram  with  severe  spread  echoes.  The  same  ionogram,  as  displayed  on  the  computer  scaling  system 
at  CRC  is  shewn  in  figure  1 1 . Figure  1 2 shows  the  digital  ionogram  redisplayed  at  a higher  threshold  level;  in  it  most 
of  the  spread  echoes  are  removed  and  the  main  traces  are  discernible. 

A trained  operator  now  performs  the  identification  of  the  traces  in  the  ionogram  and  scales  the  appropriate 
trace.  The  operator  is  better  suited  than  a computer  to  perform  these  tasks  since  they  involve  subjective  decisions 
in  pattern  recognition.  On  the  other  hand,  a computer  could  determine  the  range  and  frequency  coordinates  of  a 
tiace  (i.e.,  scale  it)  if  the  tiace  were  identified  to  the  computer.  Ir,  the  computer-aided  system  at  CRC,  the  computer 
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Figure  8.  A digitized  »o nogram  showing  the  operator's  I ^ure  The  same  ionogram  with  a revises'  scaled  .t-wase 

trace,  and  the  calculated  l/-wav  ace.  tract  and  its  corresponding  calculated  O-uave  trace. 

The  ops.rator's  error  m scaling  al  1 .6  Mlu  produces  a:, 
error  in  the  calculated  O- wave  trace  at  1 .0  MHz. 


Figure  10.  A high-latitude  Alooette  II  lOt'Ogram 
exhibiting  sesere  spread  echoes. 


Figure  11.-  The  same  ionognm.  displayed  by  the  compulct- 
uded  scaling  system,  with  a display  IhreshnU  level  cquiva- 
lent  to  that  in  the  ionogram  in  Figure  10. 


Figure  12.  The  same  digitized  ionogram  disp.as  ed  a.  a 
higher  threshold  level  to  remove  most  of  the  spread  cchi-es 
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determines  the  center  of  the  trace  by  making  use  of  the  amplitude  information  in  the  ionogram.  Hovever, 
computer  identification  of  the  traces  has  not  yet  been  attempted  because  of  the  complexity  of  the  pro*,  tern. 

Although  computer  identification  of  patterns  has  been  applied  successfully  other  disciplines  (the  analysis 
of  cloud  chamber  photographs  and  chromosome  identification)  the  task  is  more  difficult  in  the  analysis  of  topside 
sounder  ionograms  for  several  reasons.  The  signal-to-noise  ratio  is  variable  and  often  poor.  The  traces  are  often  in- 
complete and  fragmentary.  There  are  no  distinct  classes  of  ionograms;  instead,  there  is  a continuum  of  ionograms 
from  one  “type”  to  another.  Furthermore,  the  identification  of  the  vertical  traces  is  often  complicated  by  more 
prominent  oblique  and  field-aligned  traces.  Thus,  the  computet  identification  of  traces  in  topside  ionogram  presents 
a challenge  that  has  not  yet  been  met. 


SUMMARY 


This  paper  has  illustrated  how  our  knowledge  of  ionospheric  physics  has  aided  the  development  of  a mathe- 
matical method  tor  converting  topside  ionograms  into  electron  density  profiles.  The  scheme  makes  use  of  the  fact 
that  the  electron  density  in  the  topside  ionosphere  varies  exponentially  with  height  with  a slowly  changing  scale 
height.  A spherical  harmonic  model  is  used  to  determine  accurate!)  the  earth’s  magnetic  field  components  at  lami- 
nation boundaries,  anc  an  inverse  cube  variation  in  B is  assumed  within  each  lamination.  Then,  an  iteration  based  on 
the  constancy  of  geopotential  scale  height  over  a small  height  range  is  used  to  overcome  an  error  caused  by  an  uncer- 
tainty in  the  starting  density. 

Two  special  applications  of  this  inversion  scheme  have  been  discussed:  computing  an  electron  density  profile 
along  a magnetic  field  line,  and  a semiautomatic  scaling  system  in  which  redundant  information  in  the  ionogram  is 
used  to  verify  the  interpretation  and  scaling  of  the  ionogram.  With  regard  to  the  second  application,  the  computer 
identification  of  traces  in  topside  ionograms  remains  a challenging  problem. 


4-24 


DISCUSSION 


Wright:  Both  you  and  Dr.  Jackson  referred  to  obtaining  profiles  from  field-aligned  traces.  How  do  you  recognize 

when  this  occurs? 

Lockwood:  Sometimes  you  see  both  vertical  and  field-aligned  traces  on  the  topside  ionogram,  particularly  on  the 

extraordinary  trace. 

Wright:  Aren’t  there  imci  mediate  cases  where  there  is  only  one  echo  of  a kind,  perhaps  partly  guided? 

Lockwood:  Yes,  but  you  can  normally  recognize  that.  Sometimes  at  high  latitudes  you  will  see  only  the  field- 

aligned  traces. 

Wright:  Concerning  the  question  of  correcting  geomagnetic  field  with  altitude,  you  showed  a rather  elaborate  itera- 

tion scheme  that  seemed  to  be  concerned  with  removing  an  error  of  the  order  of  3 km  in  3000  km  of  real  height. 
In  comparison  with  other  errors  <lo  you  believe  this  is  justified? 

Lockwood:  Yes,  because  that’s  an  error  in  each  lamination  and  it  is  cumulative. 

Jackson:  In  the  analysis  of  ground-based  soundings  you  wouldn’t  be  concerned  with  this. 

Croft:  Are  there  any  plans  for  putting  a sounder  in  a geostationary  satellite? 

Lockwood:  We  would  like  to.  We  have  proposed  both  a magnetosphere  sounder  and  a sounder  in  a geostationary 

orbit. 

Grossi:  Concerning  mirroring  echoes,  the  topside  sounder  shows  them  to  be  absent  above  6.5  MHz.  On  the  contrary 

on  the  ground  a recent  experiment  shows  conjugate  point  echoes  up  to  1 2 MHz.  How  stable  is  the  PRF  of  the 
Alouette  and  ISIS  sounders,  such  that  we  could  do  some  time  integration  from  period  to  period? 

Lockwood:  I don’t  have  a number  for  that,  but  it  should  be  quite  stable. 

Grossi:  Could  you  integrate  coherently  for  minutes? 

Lockwood:  No,  I don’t  think  so  for  two  reasons:  The  sounder  is  sweeping  at  the  same  time;  and  we  are  getting 

the  echoes  because  of  irregularities  aligned  along  the  field,  and  the  satellite  passes  through  the  center  of  a 
duct  and  will  move  out  in  time. 

Grossi:  Sometimes  there  are  large  sheik,  however. 

Unidentified  speaker:  Combining  the  topside  and  ground-based  sounding,  could  you  measure  the  valley  between 

E and  F region? 

Lockwood:  I don’t  think  we  would  be  sensitive  enough  to  look  at  anything  but  gross  features  using  the  ground 

echoes. 

Unidentified  speaker:  Have  you  compared  topside  sounding  with  ground-based  incoherent  scatter? 

Lockwood:  The  incoherent  scatter,  in  general,  gives  comparable  data.  These  backscatter  sounder  do  require 

sounder  da'a  to  calibrate  the  F region  peak  density. 

Jackson:  Cohen,  at  Boulder,  made  20  comparisons  of  Jicamarca  data  with  simultaneous  Alouette  passes,  and 

there  was  a systematic  difference  between  the  two,  indicating  that  the  Alouette  profile  was  lower,  the  same 
sort  of  thing  I mentioned  before. 
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^7  3^  11604  abstract 

Several  alternative  methods  for  solving  the  group  height  equation  are  presented.  Three  of  these  are  now  in 
operation  at  Ames  Research  Center  and  use  data  contained  in  a single  ionogram  trace.  From  the  data  an  electron 
density  profile  N(h)  is  computed.  If  the  ionogram  also  exhibits  other  traces,  “reverse"  ionogram  traces  are  com- 
puted, using  the  N(h)  profile,  for  comparison  with  the  redundant  data.  When  agreement  is  poor,  the  initial  data 
trace  is  reinterpreted,  another  N(h)  profile  computed,  and  the  reverse  traces  generated  once  again.  This  process 
is  repeated  until  a desired  degree  of  consistency  is  achieved. 

To  reduce  the  necessity  for  human  intervention  and  eliminate  decision  making  required  in  conjunction  with 
the  preceding  methods,  a method  is  proposed  that  accepts  as  input, all  data  from  a single  ionogram.  In  general,  no 
electron  density  function  will  satisfy  these  data  exactly,  but  a “best”  N(h)  profile  can  be  computed. 

Finally,  a method  is  described  that  eliminates  the  need  to  assume  that  the  ionosphere  is  spherically  stratified. 
Horizontal  gradients  in  electron  density  are  detected  and  accounted  for  by  processing  several  ionograms  from  the 
same  satellite  pass  simultaneously.  This  idea  is  derived  as  an  extension  of  one  of  the  basic  methods. 


INTRODUCTION 


Sounder  ionogram  data  b.  (/)  can  be  converted  to  electron  density  versus  altitude  profiles  by  inverting  the 
group  height  equation 


*'(/)  = 


n\X,  Y,<p)dh 


(1) 


where 

h'  = virtual  height  or  depth  (km) 

/ = sounder  frequency  (MHz) 
h,  = height  of  sounder  (km) 
h = height  of  wave  reflection  (km) 
n'  = group  refractive  index  [Davies,  1965] 
X = (fN/f)1  = iV/(12400/J) 
fN  - electron  plasma  frequency  (MHz) 

N = electron  density  (electrons/ cm3) 
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Y = /////  = 2.8B// 
j~H  = electron  gyrofrequency  (MHz) 

B = earth’s  magnetic  field  strength  (gauss)  [Grin  and  Cain,  1968] 

<t>  = angle  between  magnetic  field  vector  [Cain  and  Cain,  1968]  and  direction  of  signal  propagation  (degs.) 

Except  for  electron  density  N,  which  is  assumed  to  be  a monotonic  decreasing  function  of  altitude  h,  all  the  above 
variables  can  be  considered  as  input  to  the  problem. 

Generally,  the  inversion  problem  is  attacked  by  modeling  the  N(k)  profile  as  an  analytic  function  depending 
on  several  free  parameters.  These  parameters  are  then  adjusted  iteratively  to  satisfy  both  the  reflection  condition 


| 1 - ■'f  X trace 

XT  - I 1 0 trace 

( 1 + Yr  Z trace 


(2) 


and  the  group  height  equation  for  each  of  a discrete  set  of  data  points  ( h'.f ). 

All  reduction  methods  discussed  here  take  advantage  of  the  essential  “triangularity”  of  Eq.  (1 ) to  compute 
only  a portion  of  the  A(/i)  profile  at  each  step.  This  is  done  by  dividing  the  ionosphere  into  a number  of  spherical 
layers  and  relating  altitude  and  electron  density  within  each  of  these  laminations  by 


* = *m 


+ a,-  In 


+ 


(3) 


Coefficients  ay  and  by  are  then  computed  for  each  lamination  in  turn. 

Although  each  layer  model  has  two  parameters,  only  one  is  left  free.  In  the  topmos*  lamination,  b2  is  set  to 
0 so  that  only  at  is  free.  In  subsequent  laminations,  ay  are  computed  by 


ay  = ay_j  + 2f»y_i  In 


Xi- 


7-1 


Xi- 
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(4) 


leaving  by  free.  Equation  (4)  provides  for  continuity  in  the  first  derivative  of  N(h)  at  the  boundaries  by.  [Note 
that  continuity  of  N(h)  itself  is  implicit  in  the  model  given  by  Eq.  (3).] 

Jackson  was  the  first  to  present  in  detail  a method  for  solving  this  problem  [Jackson,  1969a] . However,  the 
iteration  scheme  used  is  open  ended  (not  constrained)  and  as  a result,  convergence  is  not  always  obtained.  More 
recently,  Lockwood  [1970]  has  improved  the  iteration  method,  but  convergence  still  depends  on  the  accuracy  of 
an  initial  estimate  to  the  solution. 

One  of  the  primary  considerations  in  development  of  the  Amts  direct,  inverse,  and  field-aligned  reduction 
methods  was  elimination  of  uncertainties  in  both  accuracy  and  convergence  in  the  calculations.  As  a result,  conver- 
gence within  a specified  error  tolerance  can  be  guaranteed,  if  only  a solution  exists. 


I 
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Each  of  these  basic  methods  operates  only  with  data  from  a single  trace  ( X,  0,  or  Z)  of  a single  ionogram. 
The  direct  and  inverse  methods  assume  a vertical  (radial)  ray  path  and  are  distinguished  from  each  other  only  by  a 
difference  in  control  variable.  Wave  propagation  is  assumed  to  be  directed  along  the  earth’s  magnetic  field  lines  in 
the  field-aligned  method. 

Inverse  processing  is  extended  to  cope  with  the  problem  occurring  when  significant  horizontal  (latitudinal  or 
longitudinal)  gradients  exist  in  N(h)  - that  is,  when  the  ionosphere  cannot  be  accurately  represented  by  a layer 
model.  This  s->me  basic  method  is  also  shown  to  have  application  in  computing  a “best”  N(h)  profile  for  all  data 
from  a single  ionogram. 


THE  DIRECT  AND  FIELD-ALIGNED  METHODS 


The  Direct  Method 

A set  of  points  (hj,  fj)  is  chosen  to  represent  the  ionogram  trace.  Reflection  heights  hj  corresponding  to  the 
signal  frequencies  fj  are  taken  to  be  the  lamination  boundaries  and  as  such  are  a natural  control  variable  for  the  itera 
tion  process.  The  direct  reduction  method  computes  these  boundaries. 

It  should  be  observed  that  with  this  one  to  one  correspondence  between  data  points  and  lamination  boundaries, 
data  selection  has  a direct  effect  on  the  accuracy  provided  by  the  tKh)  model.  Too  few  data  points  can  give  a poor 
result  regardles  of  how  smooth  the  ionogram  trace  may  be. 

Reflection  heights  hj  are  found  by  solving  for  the  root  of  the  group  height  function 

/**• 

F(hj)  = h'(fj)~  \ n'(X,  Y,<t>)dh  M 

hi 

This  is  where  the  direct  method  differs  from  the  iteration  schemes  used  by  Jackson  and  Lockwood. 

This  solution  process  is  necessarily  iterative,  requiring  that  the  function  F be  evaluated  several  times  for 
successively  more  refined  estimates  of  hj.  Since  fj  is  specified,  an  estimate  of  hj  determines  a value  for  F 
according  to  the  following  sequence  of  operations: 


Calculation 

Source 

"re- 

the magnetic  field  parameters  are  known 

compute  <jij 

the  magnetic  field  parameters  are  known 

Yrxi 

the  reflection  condition,  Eq.  (2) 

compute  aj 

Eq.  (4)  (computed  only  once  and  only  for 

hj.Xj^bj 

Eq.  (3)  (parabolic  laminations) 

~*ai 

Eq.  (3)  (linear  laminations) 

compute  F(hj) 

Eq.  (5) 
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Figure  1.-  Direct  Method  - N(h)  laminations  for  four 
reflection  heights  hj+,.  Curve  3 corresponds  to  the 


Observe  that  in  Eq.  (5),  only  the  integral  over  the  interval 
(hi,  hj _j)  need  be  recomputed  as  hj  is  varied. 

There  are  many  available  processes  by  which  the 
value  of  h:  can  be  refined  toward  the  root  of  the  func- 
tion F.  Tne  one  used  here  is  a modified  version  of  the 
Regula  Falsi  method  [Gsfrovvsfci.  1960] . Upper  and 
lower  bounds  onhj  are  determined.  In'erntsdi#**  values 
can  then  be  selected  in  such  a way  as  to  provide  a sequence 
of  sets  of  bounding  values  that  are  successively  more  restric- 
tive. When  the  root  can  be  bounded  initially,  convergence 
can  be  guaranteed. 


maximum  allowable  value  since  N(h)  must  be  single 
valued. 


Since  F cannot  be  evaluated  for  an  arbitrarily  speci- 
fied value  of  h -,  the  initial  boundary  process  must  be  per- 
formed carefully.  If  hj  is  chosen  too  large,  the  parabolic  N(h)  model  will  be  forced  to  take  a minimum  in  the  h 
direction  as  shown  by  curve  4 in  figure  1 . This  does  not  physically  occur  in  the  topside  ionosphere  and  will  be  reflected 
by  an  imaginary  value  of  ri  when  evaluated  at  points  within  the  lamination. 


An  upper  limit  for  hj  is  found  by  solving  dh/d(lnA  ) = 0 where  h is  given  by  Eq.  (3).  Calculation  of  this 
maximum  involves  solution  of  a system  of  equations  since  .he  reflection  condition  and  magnetic  field  relation  must 
also  be  satisfied.  Curve  3 in  figure  1 represents  the  model  profile  corresponding  to  this  marginal  solution.  Curves  1 
and  2 represent  possible  acceptable  solutions. 


For  a discussion  of  the  lower  limit  on  hj,  the  ionogram 
traces  must  be  considered  separately 


Figure  Direct  Method  - Behavior  of  2f-(l  + V)  for 
three  reflection  heights  (Z  -trace  calculations 
only).  Curve  2 corresponds  to  the  minimum  allowable 
value  since  AT  — ( 1 V ) must  be  negative. 


X trace:  The  reflection  condition  X - 1 - Y 

provides  an  upper  limit  for  Yj  = 1 - Xj_  | 
The  minimum  value  of  hj,  then,  is  the 
value  corresponding  to  the  magnetic  field 
strength  given  by  max  Yj. 

0 trace:  No  physical  limit  can  be  inferred.  An 

arbitrary  value  of  10  km  is  prescribed. 

Z trace:  The  quantity  X - ( 1 + Y ) must  remain 

negative  throughout  the  lamination  if 
Z wave  propagation  is  to  be  allowed - 
Curve  3 in  figure  2 corresponds  to  a 
physically  impossible  solution.  Curve  2 
represents  the  marginal  case  correspond- 
ing to  the  solution  of  d ( X - ( 1 + Y )]  / 
dh  = 0.  Curve  1 corresponds  to  a possi- 
ble acceptable  solution.  As  before,  the 
reflection  condition  and  the  magnetic 
field  relation  are  imposed. 


Limits  computed  in  this  way  will  normally  bracket  the  root  hj  to  be  computed;  for  example,  the  sign  of  the 
function  F will  be  different  for  evaluations  at  the  upper  and  lower  limits.  However,  if  no  sign  change  is  observed 
then  no  root  exists,  and  it  becomes  pointless  to  initiate  the  iteration  process.  Furthermore,  since  the  lack  of  a root 
generally  indicates  an  error  in  the  data,  the  value  of  F obtained  (hj  is  only  a reference  level  to  this  value)  indicates 
the  correction  to  the  data  which  is  necessary  so  that  a solution  to  Eq.  (5)  does  exist. 
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The  Field-Aligned  Method 

In  most  cases,  the  results  of  data  reduction  as  performed  assuming  vertical  signal  propagation,  are  consistent 
with  the  known  physical  s tuation.  Sometimes,  however,  a vertical  assumption  is  obviously  invalid  and  the  most 
likely  alternative  is  that  the  signal  has  propagated  along  “tubes”  aligned  with  the  earth’s  magnetic  field. 

A field-aligned  method  is  designed  for  data  reduction  when  such  is  assumed.  In  concept  this  method  is  iden- 
tical to  the  direct  one,  and  requires  only  that  h and  h'  be  replaced  by  distances  p and  p , along  the  field  line 
discussed  above,  and  that  the  calculations  be  performed  along  that  field  line. 


THE  INVERSE  METHOD 


Inverse  processing  is  so  designated  because  instead  of  specifying  frequencies  and  computing  reflection  heights 
as  with  the  direct  method,  one  solves  the  inverse  problem.  Frequencies  are  determined  that  correspond  to  reflection 
at  a specified  set  of  altitudes.  Inverse  processing  is  also  discussed  by  Madsen,  [ 1970s) . 

One  advantage  of  this  approach  is  that  the  approximation  error  inherent  in  the  N(h)  model  can  be  moie 
easily  controlled.  More  significant,  however,  are  the  sophisticated  extensions  of  the  method  that  are  possible. 

The  ideas  that  motivated  the  calculation  process  of  the  direct  method  also  prevail  here.  A group  height 
function 


G(fj)  = 


n\X,  Y,<l>)<lh 


(6) 


is  defined,  and  the  root  fj  found  by  iteration.  The  following  sequence  of  operations  indicates  how  the  function  G 
is  evaluated  for  a given  estimate  of  f-. 

Calculation  Source 


compute  Bj,  4>j 


/.-*  Y- 
Jl  1 


Y ■ -*  X- 

I i 


the  magnetic  field  parameters  are  known  (computed  only  once) 

Yj  = 2.8  Bj/jj 

the  reflection  condition,  Eq.  (2) 


compute  aj 

hrxrbi 


frh} 


Eq.  (4)  (computed  only  once  and  only  for  parabolic  laminations) 

Eq.  (3)  (parabolic  laminations) 

Eq.  (3)  (linear  laminations) 

An  initial  set  of  data  points  ( h',f ) is  “curve-fit”,  and  hj  corresponding  to  fj  is  found 
by  interpolation. 


compute  G(fj)  Eq.  (6) 
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F.g'ire  3.-  Inverse  Method  - N(h)  laminations  for 
four  frequencies  reflecting  at  hy+j.  Curve  3 corre- 
sponds to  the  maximum  allowable  frequency  since 
A'(/t)  must  be  single  valued. 


As  before,  the  bounds  are  obtained  from  the 
N(h)  model  and  the  reflection  condition.  Curve  4 in 
figure  3 corresponds  to  a physically  impossible  solution. 
An  upper  bound  on  f:  is  found  by  solving  dft/d(ln30  = 0 
in  conjunction  with  the  proper  reflection  condition  and 
the  magnetic  field  relation.  Curve  3 in  figure  3 represents 
the  maximum  attainable  electron  density  and  thus  by  the 
reflection  condition,  a maximum  value  for  fj.  Curves  1 
and  2 correspond  to  possible  acceptable  solutions.  In 
this  case,  the  maximum  can  be  directly  evaluated.  It  is 
not  necessary  to  solve  a system  of  equations. 

Computation  of  a lower  limit  again  depends  on  the 
trace  from  which  the  calculations  are  performed: 

X trace:  The  reflection  condition  is  solved  for 

min  fj  with  fH  - fH , and 
fN  -fNj-\ 

0 trace:  min/y=/y_j 

Z trace:  the  quantity  2f-(l  + Y)  must  remain 

negative  throughout  the  lamination. 
Curve  3 in  figure  4 corresponds  to  a 
physically  impossible  solution.  Curve  2 
represents  the  marginal  case  correspond- 
ing to  d[JV  — 1.1  + Y)]/dh  =0.  Curve  1 
corresponds  to  a possible  acceptable 
solution. 


Figure  4.-  Inverse  Method  - Behavior  of  AT— (1  <-  Y ) 
for  three  reflection  frequencies  (2-trace  calculations 
only).  Curve  2 corresponds  to  the  minimum  allow- 
able frequency  since  Jf— (1  + K)  must  be  negative 
at  every  point  within  the  lamination. 


Although  the  inverse  method  does  offer  some  im- 
portant i dvantages  over  the  direct  method,  it  is  inherently 
more  time  consuming.  This  follows  primarily  from  the 
necessity  for  computing  the  integral  in  Eq.  (6)  over  the 


entire  altitude  range  for  each  trial  value  of  fj . An  added  complication,  though  not  as  severe,  is  in  providing  the  cap- 
ability to  numerically  determine  a data  value  hj  for  any  specified  fj. 


THE  OPTIMAL  LAMINATION  METHOD 


This  method  attempts  to  profitably  use,  simultaneously,  all  the  available  ionogram  data  pertinent  to  the  cal- 
culation of  a single  lamination  of  the  N(h)  profile.  At  least  one  of  the  ionogram  traces  must  be  visible  beginning 
at  zero  virtual  depth.  The  inverse  method  then  procedes  as  usual  with  the  exception  that  instead  of  searching  for 
the  root  of  the  group  height  function  G [Eq.  (6)) , one  looks  for  the  minimum  of  a given  object  function.  The  pur- 
pose of  this  function  is  to  provide  a measure  of  the  consistency  between  the  data  and  the  computed  A \h)  profile. 

Consider  that  the  reflection  heights  ht,  Aj, ...  ,hfj  (and  thus  the  gyrofrequencies  fHt,  fH3, . . . ,fHjy), 
have  been  specified.  The  inverse  method  then  computes  the  corresponding  frequencies  f3,f3, . . . ,f^.  , 

For  a given  estimate  of  N (or  fN)  at  the  current  reflection  height  h-,  a frequency  /■  is  obtained  for  each 
of  the  ionograms  traces  from  the  reflection  conditions  ’• 
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2 


/ 


X trace:  * = 1 - T - /?V3  = f(f-fH)  ->f=(fH  + s/fH3  + 4fN3)/2 

0 trace:  X=\^fN3=f3  ■* /=  fN 

Z trace:  * = 1 + T^/NJ  = /(/  + ///)  - /-  (-/W  + V/W*  + 4/7V!)/2 

Using  these  frequencies,  one  then  computes  from  Eq.  (1),  the  corresponding  virtual  hc;<rht 

.hi 


he  (//>  = r l«ur, 

•'I* . 


0)d/i 


for  each  trace.  The  subscript  c indicates  a computed  value.  The  object  function,  defined  for  example  as 


/(/IV) 


[*i  («)-* ’(/))]' 


x,o,z 


is  minimized  with  respect  to  fN.  Note  that  if  h (fj)  is  visible  for  only  a single  trace,  the  minimum  value  of  J ( fN ) 
is  zero  and  the  method  is  equivalent  to  the  inverse. 

As  an  example,  consider  the  case  where  the  X trace  is  continuously  visible  from  the  satellite  position  but  not 
out  so  far  as  to  display  ground  echoes;  the  visible  0 trace  begins  at  some  nonzero  virtual  height  and  does  exhibit 
ground  echoes;  the  Z trace  is  partially  visible  (fig.  5).  All  visible  data  would  be  scaled  and  curvefit  for  later  interpo 
lation. 


If  the  altitudes  ht , h2 , • • * ,h7  coirespond  to  the 
would  be  determined  in  the  first  lamination  by  the  scaled 
trace  alone,  in  the  third  and  fourth  laminations  by  the  X 


Figure  5.-  Idealized  sample  ionogram  showing  visible  portions 
of  X,  O,  and  Z traces.  The  points  hj  indicate  the  alti- 
tudes at  which  the  corresponding  frequencies  have  reflected. 


data  as  shown  in  figure  5,  then  the  N(h)  profile  (fig.  6) 
X and  Z traces,  in  the  second  lamination  by  the  A" 
and  O traces,  and  in  the  remaining  laminations  by  the  O 


Figure  6.-  Idealized  N(h)  profile  ,is  would  be  computed  by 
the  optimal  lamination  method.  Data  from  the  indicated 
traces  is  used  in  the  computation  of  the  individual  lamina- 
tion boundaries  (see  fig.  5). 
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trace  alone.  Thus,  all  visible  ionogiam  data  impact  the  solution  for  the  N(h)  profile  in  a single,  computer- 
controlled  operation. 

Since  the  choice  of  the  object  function  J is  arbitrary,  various  bits  of  scaled  data  can  be  weighted  according  to 
the  confidence  which  they  inspire.  Also,  a different  object  function  can  be  selected  with  only  minor  changes  to  the 
computer  program. 


THE  HORIZONTAL  GRADIENT  METHOD 


A fundamental  feat  jre  of  ionograms  produced  by  orbiting  swept  frequency  sounders  is  that  each  of  the  trans- 
mitted pulses  traveis  through  a different  ionosphei  s due  to  satellite  motion  within  the  orbital  plane.  The  reduction 
methods  discussed  previously  do  not  take  this  into  account,  but  assume  that  the  ionosphere  is  spherically  stratified. 
This  is  not  generally  trt  e.  The  existence  of  region:  where  the  ionosphere  displays  significant  horizontal  gradients  in 
electron  density  are  wed  Known  [Jackson , 1969b] . 

A conceptually  simple  method  [see  Madsen.  1970b  for  a more  detailed  description]  that  accounts  for  this  hori- 
zontal variation  of  electron  density  can  be  emploj  ed  if  two  or  more  ionograms  are  processed  simultaneously.  The 
basic  computing  block  of  this  method  is  the  inverse  method  above. 


Figure  7.  - Reflection  heights  versus  time.  The  x locations 
are  points  where  electron  density  is  calculated  by  the 
Inverse  Method.  The  o locations  are  points  where 
interpolated  values  of  election  density  would  be  obtained 
in  the  Horizontal  Gradient  Method. 


Consider  a point  in  the  orbital  plane  to  be  defined 
by  the  coordinates  h = reflection  height  and  t = time 
(frequency  could  be  used  in  place  of  time).  For  specified 

reflection  heights  h2,  h3 tyv+l-  the  inverse  method 

computes  N(h)  lamination  profiles  sequentially,  down- 
ward from  the  satellite  position.  Each  successive  calcula- 
tion occurs  at  a later  time  and  requires  as  input,  the  values 
of  N at  all  reflection  heights  in  figure  7 directly  above 
the  one  currently  considered. 

For  example,  the  calculation  of  N(h2,  t2)  requires 
as  input  the  values  N(h2,  t2)  and  N(ht,  t2).  However, 
only  the  approximations  N(h2,  t ,)  and  N{h lP  t0)  are 
known  from  previous  calculations  when  ionograms  are 
processed  singly.  Using  these  approximate  values  in  the 
calculations  is  equivalent  to  assuming  spherical  stratifica- 
tion in  the  ionosphere. 


If  the  horizontal  gradients  in  N(h)  were  known, 
this  variation  with  time  could  be  incorporated  into  the 
calculations  to  give  more  accurate  results.  Specifically,  if  values  of  /V'(b)  = (dMdf)/,  for  h = b,p  h2, . . . , hj  are 
available,  then  in  the  calculation  of  iV(hy+  j ),  one  could  use  the  following  more  accurate  values  of  N at  tj. 


Wl  tj)  = N(hv,  b)  + «j  - b ) N'(h , ) 

Wt.  tj)  = N(h2,  r,)  + (tj  - tx)N‘(h2) 

• • * • • • 

N(hj.  tj)  » N(hj,  tj-i)  + (tj-tj-i)N'(hj) 
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ih'j  horizontal  gradient  method  establishes  values  for  the  derivatives  N'(h)  by  simultaneous  calculation  of 
several  contiguous  ionograms.  The  following  step-by-step  procedure  describes  the  method. 

1 Compute  N(kt , f0)  for  each  ionogram. 

2.  Define  N'(h i ) to  oe  the  slope  of  the  "best”  straight  line  through  *hese  points  when  ;V  is  plotted 
versus  time. 

3.  Using  Eq.  (7),  compute  the  values  N(hx,  tt ) for  each  ionogram. 

4.  Using  the  inverse  method,  compute  N(h2,  (,)  for  each  ionogram. 

5.  Repeat  steps  2 through  4 for  h=h-l,hi 

The  first-order  variations  of  N(h)  in  the  time  direction  are  thus  accommodated. 

Theoretically,  it  is  possible  to  detect  and  measure  horizontal  gradients  within  the  time  (distance)  necessary  to 
prodiXNt  a single  ionogram,  through  the  judicious  use  of  resonances,  electromagnetic  cutoffs,  and  electron  gyrofre- 
quency  resonances.  In  practice,  however,  this  approach  h precluded  by  the  inaccuracies  due  to  resonance  broadening 
and,  in  some  cases,  poor  frequency  resolution  [Colin,  private  communication] . 

One  suspects  then  that  it  might  be  possible,  over  a distance  of  several  ionograms,  to  account  for  more  detailed 
variation  of  N{h ) in  the  horizontal  direction.  This  hypothesis  could  be  tested  by  computing  the  two  parameters 
describing  the  “best”  parabolic  description  of  the  horizontal  variation  in  step  2 above,  and  modifying  Eq.  (7)  accord- 
ingly. A good  parabolic  fit  to  the  data  would  justify  this  appraoch. 


SUMMARY 


The  direct,  field-aligned,  and  inverse  methods  for  ionogram  reduction  operate  reliably  and  as  accurately  as 
possible  on  data  obtained  from  a single  ionogram  trace.  The  inverse  method  is  extended  to  provide  an  N(h)  profile 
consistent  with  all  data  from  a single  ionogram.  This  is  accomplished  without  operator  intervention.  A separate 
extension  of  inverse  provides  greater  accuracy  in  the  computed  JV(/i)  profile.  This  is  accomplished  by  processing 
several  ionograms  simultaneously  so  as  to  account  for  horizontal  gradients  in  electron  density. 

In  the  description  of  the  methods  for  which  signal  propagation  is  vertical,  the  sounder  height  h , has  been 
assumed  constant,  for  simplicity  only,  throughout  the  calculation.  The  direct  and  inverse  methods  as  implemented 
at  Ames  Research  Center  do  account  for  sounder  altitude  variations.  This  capability  can  also  be  incorporated  into 
the  proposed  horizontal  gradient  method  with  little  effort. 

Hie  field-aligned  method  fixes  the  sounder  at  some  initial  position  so  that  the  time-consuming  process  of 
computing  field  line  parameters  is  not  required  for  each  individual  data  point.  It  follows  that  development  of 
inverse  and  horizontal  gradient  analogs  for  field-aligned  signal  propagation  is  not  feasible,  since  they  would  be  even 
more  complicated  and  time  consuming. 
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DISCUSSION 


Parker:  I didn’t  fully  understand  your  method  for  taking  out  the  assumption  of  spherical  layering.  Do  you  need 

many  ionograrrs  to  do  this? 

McCulley:  You  need  at  least  two. 

Wright:  The  solution  to  a succession  of  ionogiams  obtained  along  the  path  of  the  satellite  isn’t  necessarily  going  to 

resolve  the  horizontal  gradient  unless  the  gradient  also  lies  along  the  satellite’s  path. 

McCulley:  These  satellites  are  mostly  in  polar  orbits  where  latitude  is  the  main  change  along  the  satellite  track. 

Croft:  When  you  make  soundings  of  the  ionosphere  using  techniques  that  are  sensitive  to  the  map  of  the  iono- 

sphere, like  HF  ground  backscatter,  you  see  much  evidence  of  traveling  irregularities  that  have  periodic 
structure.  You  would  have  to  be  careful  that  your  sweep  rate  wasn’t  in  synchronism  with  this  period. 
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It  is  well  known  that  the  calculation  of  electron  density  profiles  from  ionogran  s cannot  provide  unique  results, 
since  the  information  required  ' incomplete.  The  anisotropy  of  the  ionosphere  provides  a means  to  reduce  the  un- 
certainty, as  has  been  recognized  by  Argence,  [1954]  and  Storey  [I960] . A systematic  investigation  about  the  inde- 
pendence of  the  two  magnetoionic  components  show  that  the  extent  to  which  the  range  of  ambiguity  can  be  reduced 
depends  mainly  on  the  magnetic  latitude.  On  the  other  hand,  it  is  even  more  important  that  the  numerical  method 
used  for  the  inversion  is  basically  very  accurate.  Studying  the  properties  of  the  integral  transform  we  come  directly 
to  some  conclusions  about  the  minimum  iequirements  for  a numerical  inversion  method.  Some  remarks  are  given 
about  error  estimates  and  first-order  corrections  for  less  elaborate  methods. 


INTRODUCTION 

In  vertical  sounding  of  the  ionosphere  we  obtain  the  virtual  height  h ' as  a function  of  radio  frequency  f.  The 
virtual  height  is  related  to  the  reflection  height  by  an  integral  equation  of  Abel’s  type.  If  this  integral  equation  can 
be  solved  we  obtain  first  the  reflection  height  as  a function  of  the  radio  frequency;  then,  using  the  reflectic  .*  condition 
relating  radio  frequency  and  electron  density  at  the  reflection  level,  we  obtain  the  electron  density  as  a function  of 
height,  the  electron  density  profile.  The  solution  process  of  the  integral  equation  requires  that  the  virtual  height  be 
knowr.  for  all  frequencies  up  to  the  frequency  for  which  the  reflection  height  is  to  be  determined  and  that  the  elec- 
tron density  be  a monotonic  increasing  function  of  height.  While  the  second  condition  may  be  fulfilled  for  a rela- 
tively large  percentage  of  lonograms,  the  first  condition  is  practically  never  fulfilled  [ Manning , 1949] . To  obtain  an 
exact  and  unique  solution,  we  need  additional  information.  Some  additional  information  is  available  as  will  be  shown 
later,  but  usually  .t  s not  sufficient  [Paul  and  Smith,  1968] . We  then  have  to  introduce  a physically  reasonable  model 
assumption  and  try  to  estimate  the  range  of  uncertainty  of  the  profile  obtained  this  way.  Some  insight  into  the  errors 
of  this  process  can  be  obtained  by  studying  the  properties  of  the  integral  transform.  At  the  same  time  we  will  learn 
about  the  requirements  for  a rapidly  convergent  method  of  numerical  solution  [Paul,  1967] . 

Since  the  ionosphere  is  a doubly  refracting  medium,  the  question  arises  <f  and  how  much  additional  information 
can  be  obtained  by  using  both  notarizations  in  the  inversion  process.  The  results  of  an  investigation  of  this  problem 
will  be  presented  later  on. 

We  propose  some  practical  procedures  giving  first-order  corrections,  when  no  additional  information  is  available 
or  where  computing  facilities  are  very  limited. 
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PROPERTIES  OF  THE  INT£GRAL  TRANSFORM 


The  virtual  height  h ’ is  given  by 


h‘  = 


(» 


where  h r is  the  height  of  reflection  and  z is  the  height  variable.  If  we  neglect  the  earth’s  magnetic  Field,  the  group 
refractive  index  is 


ft' 


1 

s/TTy 


(2) 


where  X = fy2  If2,  f is  the  radio  frequency,  ffj  the  plasma  frequency  and  f^2  (»)  is  proportional  to  the  electron 
density.  The  radio  signal  is  reflected  where  X = 1 or  f/^(hr)-f.  If  the  electron  density  is  never  decreasing  with 
increasing  height,  the  virtual  height  can  be  expressed  by 


*'(/)= 


r 


i 


vT^T 


z AX 


(31 


where  hr,  i.,  the  lowei  boundary  - r the  ionosphere  and  z - dz/d ffl2 . 

For  a given  electton  density  profile  Eq.  (3)  represents  an  integral  transform.  Its  most  essential  feature  is  the 
singularity  at  the  reflection  height.  Equation  (3)  can  be  rewritten 


h' 


(4) 


We  see  immediately  that  the  difference  between  virtual  height  and  reflection  height  is  determined  mainly  by  the 
dope  of  the  profile  near  the  reflection  height,  since  the  factor  by  which  z'  is  weighted  in  Eq  (4)  is  very  small  for 
small  values  of  X and  tends  to  become  infinite  when  X approaches  unity.  Similarly . it  is  easy  to  see  that  h'  if 
always  finite  if  z'  is  finite.  On  the  other  hand,  it  is  possible  that  h'  is  f.nite,  even  if  z is  infuvte.  This  wan  be 
demonstrated  in  the  following  example:  Consider  an  electron  density  profile  given  by 


V. 


For  r - 1 this  represents  a parabolic  profile  with  the  maximum  density  proportional  to  fc7  at  the  height  z-hm 
and  a half  thickness  ym-  If  we  solve  Eq.  (5)  for  z and  differentiate,  we  obtain  as  the  expression  for  the  slope 
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1* 
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r 


of  the  profile 


ym  1 

/cJ  2r 


-(2r-l)/2r 


(6) 


If  r>  1/2  the  slope  becomes  vertical  for  ftf-fc,  since 
z'  becomes  infinite.  Introducing  Eq.  (6)  into  (3),  the 
integral  is  convergent  for  /=  fc  if  r < 1 , and  we  obtain 


h\fc ) = hQ  + 


1 

~rym 


1 

for — <r<  1 


(7) 


For  1 12  < r < 1 we  therefore  have  a model  layer  with  a 
maximum  density  for  which  the  virtual  height  at  the  cri- 
tical frequency  is  finite.  The  parabolic  maximum  with 
/ = 1 represents  just  the  limiting  case  between  finite  and 
infinite  virtual  height  as  is  indicated  by  the  logarithmic 
singularity  in  the  expression  for  the  virtual  height  of  a 
parabolic  layer.  The  profile  and  the  virtual  heigh*  for 
r = 2/3  are  shown  in  figure  1.  A few  more  examples  may 
illustrate  how  different  the  effect  of  a vertical  slope  on 
the  virtual  height  can  be.  A model  given  by 


ho 

ho+ym 


(8) 

f/V>fm 


is  continuous  at  the  base  of  tne  layer  where  = fm.  but 

it  has  an  infinite  slope.  The  virtual  height  is  given  by 


h0  0 <f<fm 

ho+ym  f>fm 


(9) 


There  the  vertical  slope  of  the  profile  results  in  a finite  dis- 
continuity of  ti.  This  is  shown  in  figure  2.  The  virtual 
heights  shown  here  are  usually  interpreted  as  reflections 
from  two  flat  thin  layers  a.  different  heights  with  relatively 
low  electron  density  in  between,  which  is  probably  correct, 
but  cannot  be  proven  by  using  ti  observations  alone,  as 
demonstrated  in  this  example. 


Figure  1.-  Electron-density  profile  (labeled  h ) and  virtual- 
heighi  curve  (h')  for  the  model  given  by  2.5  and  2.7. 


Figure  2. ~ Profile  (h)  and  virtual-height  curve  (A),  on 
normalized  scales,  for  the  model  given  by  2.8  and  2.9. 


i 
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For  models  given  by 


again  we  have  a continuous  profile  and  an  infinite  z at  /yy  = fm>  if  *<2.  The  virtual  height,  however,  may  have 
an  infinite  discontinuity  or  a finite  discontinuity,  or  it  may  be  continuous,  depending  on  whether  0 < k < 1 , k = 1 , 
or  1 < k < 2.  Profiles  and  virtual  heights  for  the  three  cases  are  shown  in  figure  3. 

The  examples  so  far  have  shown  that  a continuous  profile  may  have  a continuous  h ' curve  even  if  z ' is  infi- 
nite. or  a singularity  of  z'  may  result  in  a finite  or  infinite  discontinuity  of  ft'.  On  the  other  hand,  if  a finite  or 
infinite  discontinuity  is  observed  in  an  ionogram  it  is  impossible  to  find  out  whether  the  profile  is  continuous  or  not. 


It  is  also  interesting  to  study  the  effect  of  a discontinuity  of  z'.  In  figure  4 we  have  a model  where  the  elect,  on 
density  is  linearly  increasing  with  height,  but  the  slope  .s  discontinuous  at  fj y*  = /, 2 . In  the  upper  portion  the 


Figure  3.-  Model  profiles  given  by  2.10  and  Figure  4.-  Profile  (h)  and  virtual-height  curve  (*  ) 

corresponding  virtual  heights.  for  a model  with  a discontinuous  slope. 
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slope  is  increased  for  fpj1  > /, 1 in  the  lower  portion  the  slope  is  decreased.  The  resjlt  is  an  infinite  slope  of  the 
ti  curve,  with  h'  increasing  vertically  in  the  upper  model  and  decreasing  vertically  in  the  lower  one. 

Cusps  like  the  one  shown  in  the  lower  part  of  figure  4 are  often  observed  in  ionograms.  On  the  other  hand, 
if  the  inversion  of  the  integral  equation  is  based  on  a lamination  method,  the  model  used  should  permit  a continuous 
slope  everywhere  except  where  a cusp  indicates  a discontinuity  or  at  least  a very  fast  change  of  the  slope:  otherwise, 
the  h ' curve  recalculated  from  the  result  will  fit  the  original  virtual  heights  very  poorly.  In  other  words,  at  least  a 
second-order  approximation  should  be  used  within  each  segment  to  allow  for  continuous  slopes  over  the  whole 
profile  to  assure  satisfactory  convergence  to  the  true  shape  of  the  profile.  Furthermore,  it  can  be  shown  that  a finite 
discontinuity  of  the  second  derivative  of  the  profile  will  always  result  in  a continuous  virtual  heights  and  continuous 
slopes  of  h',  which  means  that  in  most  cases  a second -order  model  will  also  be  sufficient  for  good  convergence. 


A SECOND-ORDER  INVERSION  PROCESS 


A second-order  method  can  easily  be  derived  by  performing  a partial  integration  of  (3): 


We  now  divide  the  integration  range  in  several  intervals  and  assume  that  z"  is  constant  within  each  interval 
while  z is  continuous  also  at  the  interval  limits.  If  we  now  define 


*(/i vJ) 


1 


dV 


(12) 


and  gj  = g(fj2),  we  can  write 


hn 


Si  zi~  1 £/-l) 


With  the  assumption  of  a constant  second  derivative,  we  have 


(13) 


(14) 


If  all  the  z-  for  0 < i < n - 1 are  known,  Eq.  (13)  contains  only  one  unknown  z'n  for  which  this  equation  can  be 
solved.  Therefore,  if  the  height  and  the  first  derivative  fo'  = 0 are  known,  the  integral  equation  can  be  inverted 


k 

j 


4-41 


step  by  step.  We  first  obtain  the  derivatives  at  the  interval  limits.  The  second  derivative  is  ’hen  obtained  by  Eq.  (14), 
and  finally  integration  over  the  first  derivatives  gives  the  height 

hn  ~ hn-l  +y(2n  + Zn-l)  (fy2  ~ fn-\)  (15) 

The  same  principle  can  obviously  also  be  used  in  the  general  case  if  the  earth’s  magnetic  field  is  included.  We  only 
have  to  replace  \(\J  \ - X by  u’  of  the  component  used. 

A computer  program  based  on  this  principle  has  been  used  very  successfully  in  the  Boulder  Laboratories  for 
almost  10  years  for  inversion  of  ionograms  to  electron  density  profiles  of  the  ionosphere.  The  method  has  advan- 
tages in  addition  to  the  high  accuracy  obtainable.  The  assumptions  made  can  be  used  consistently  for  interpolation, 
ray  tracing,  and  extrapolation  for  the  maximum  of  a layer,  since  z'  and  z"  are  obtained  explicitly.  There  are,  how- 
ever, two  basic  difficulties  in  the  calculation  of  electron  density  profiles.  First,  we  can  never  obtain  h ' for  very  low 
frequencies  because  of  the  so-called  starting  problem;  second,  the  ionosphere  consists  often  of  several  layers  with 
minima  of  electron  density  in  between,  and  we  observe  discontinuities  or  irregularities  of  h'  on  the  ionograms.  As 
demonstrated  earlier,  there  is  no  way  to  obtain  a unique  solution  if  only  h ' of  one  component  is  used  in  the  inver- 
sion process.  The  joint  use  of  both  components  can  help  to  reduce  the  range  of  uncertainty. 


JOINT  USE  OF  BOTH  COMPONENTS 


Since  the  two  magnetoionic  components,  ordinary  and  extraordinary  ray,  have  different  reflection  conditions 
we  have  to  compare  them  for  corresponding  radio  frequencies,  so  that  both  are  reflected  at  the  same  height.  If  /// 
is  the  electron  gyrofrequency  of  the  earth’s  magnetic  field,  such  a pair  of  frequencies  f0  and  fx  are  related  by 


fx  = jfH  + 


(16) 


In  general,  for  any  given  height  between  the  lower  boundary  of  the  ionosphere  and  the  reflection  height,  the 
group  refractive  indices  are  different  for  those  two  frequencies;  therefore,  we  expect  to  improve  the  results,  where 
uncertainties  are  involved,  by  the  joint  use  of  both  components.  The  expression  for  the  virtual  heights  of  a mono- 
tonic layer  for  the  two  components  are 


h'0=h0 


h'x  = h0 
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+ J 
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rf2  % 

J W Vn% 


tl7) 


(18) 


It  is  convenient  to  change  our  notation  by  introducing  the  7)  asm?  frequency  at  the  reflection  level  by 


i/7rr77^ 


ordinary  component 
extraordinary  component 


(r- 
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and  then 


A 


(20) 


Both  group  refractive  indices  now  have  the  same  singularity  and  can  be  written  as 


, _ M0 


ordinary  component 


(21) 


n'x  ~ 


Mx 


extraordinary  component 


(22) 


where  M0  and  Mx  are  slowly  varying  functions  of  X and  0. 

Abel  gave  an  exact  solution  for  the  type  of  integral  equation  we  have  in  Eq.  (3).  For  M0  - 1 (at  the  magne- 
tic equator)  it  is  given  by 


2 f*n  1 

h(i>n)s—  I “F=r== f*(«)d0  (23) 

n J0  V0n  ~<t> 


We  shall  attempt  to  find  a solution  of  the  general  integral 
Eqs.  ( 1 7)  and  ( 18)  by  modifying  Abel’s  solution  to 


2 f^n  S0 

h(0h)=—  y—yhMd*  (24) 
* J0  Vtn 

and 

**.)-“  I ~7==f  h'xmt>  (25) 


for  orditiary  and  extraordinary  components,  respec- 
tively. The  functions  Sb(0,0n)  and  SjC(0,0n)  can  be 
determined  numerically  without  difficulties.  An  example 
is  given  in  figure  5. 


1.0 


Figure  5.-  Functions  Sn (<>,$„)  and  Sx(<p, <{>„),  solid  lines. 
The  corresponding  function  without  magnetic  field  equals 
the  constant  2/*  shown  as  dashed  line  for  comparison. 


As  mentioned  earlier,  Eqs.  (24)  and  (2S)  are  only  correct  if  the  electron  density  is  not  decreasing  with  increasing 
height;  otherwise,  we  will  have  different  values  for  the  two  integrals  in  (24)  and  (25),  respectively.  If  we  formally 
apply  this  solution  method  and  obtain  different  values,  then  this  difference  is  due  to  a discontinuity  of  the  reflection 
height  if  a singularity  or  discontinuity  of  h'  is  involved,  or  it  has  to  he  attributed  to  low  electron  densities  for  which 
no  observations  were  outainable.  If  we  find  such  a difference  we  know  that  there  is  a “valley”  between  two  layers 
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Figure  6.  Model  ionosphere  with  valley. 


or  there  is  a significant  amount  of  ionization  at  low 
densities,  but  the  question  now  is  whether  the  profile  of 
those  parts  can  be  calculated  from  this  difference  which 
depends  on  frequency. 

Without  going  into  the  mathematical  details  we 
will  present  the  principles  for  studying  this  question.  We 
consider  an  electron  density  distribution  consisting  of 
two  layers  as  shown  in  figure  6.  We  apply  formally  the 
solution  process  to  ordinary  and  extraordinary  compo- 
nents at  corresponding  frequencies  and  take  the  differ- 
ence of  the  results.  We  obtain  a new  integral  equation 
for  the  effective  slope  z'ejy  of  the  unobserved  portion 
of  the  profile. 


where  D is  the  difference  of  the  results  and  K is  the  new 
kern.’!  function,  which  can  be  determined  numerically.  The  effective  slope  is  the  sum  of  the  absolute  values  of  the  two 
(or  more)  slopes  of  the  actual  profile  at  each  plasma  frequency  in  a valley  between  the  two  layers.  The  integral  over 
the  effective  slope  gives  the  correct  height  increment  for  example  h3  - hx  in  figure  6.  A few  examples  of  the  kernel 
function  are  shown  in  figure  7.  The  kerne)  function  depends  strongly  on  the  dip  angle  and  weakly  on  frequency.  A 
first-order  estimate  for  the  height  step  or  ‘valley  width”  can  be  obtained  in  the  following  way.  Since  z'ejr  is  always 
positive  we  can  apply  the  mean  value  theorem  for  integrals 


J, 

| zeff  Vn  --  K/C'is-'h)  (27) 

<t>2  ■ 

Mmn 

where  Kj  is  a value  of  the  kernel  function  somewhere  in  the  interval  between  <>min  and  ^max  The  lowest  plasma 
frequency  in  the  valley  0min  ‘s  unknown.  So  we  see  that  for  small  dip  angles,  if  0min  is  not  very  close  to  $max, 
the  kernel  function  is  almost  constant  and  relatively  good  estimate  for  Kj  and  therefore  for  the  height  step  can  be 
obtained.  This  is  no*  true  for  dip  angles  around  40s  since  the  kernel  function  is  then  very  small.  For  larger  dip  angles, 
if  the  valley  is  not  too  shallow  and  not  too  deep,  r.ga  n a fairly  good  estimate  of  the  height  step  is  possible.  The  uncer- 
tainty of  Kj  increases  again  towards  90°  dip  angle,  and  therefore  the  valley  v/idth  becomes  less  accurate. 

We  may  conclude  that  depending  on  dip  angle  a first-order  estimate  of  the  height  step  may  be  possible,  but  we 
also  realize  that  no  further  details  of  the  shape  of  the  valley  are  obtainable,  at  least  in  practice,  where  the  accuracy  of 
the  virtual  heights  is  limited. 


ERROR  ESTIMATES  AND  FIRST  ORDER  CORRECTIONS 


We  have  seen  that  the  accuracy  of  an  estimated  height  step  depends  mainly  on  the  dip  angle  if  both  components 
are  used  in  the  inversion  process.  In  practice,  there  is  no  chance  to  learn  much  more  abc at  the  distribution  in  a valley 
or  in  the  low-density  portion  of  the  profile.  This  is  partly  due  to  the  limited  accuracy  of  the  virtual  heights,  but  in 
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many  cases  even  more  to  the  inconsistency  of  ordinary  and  extraordinary  components  if  the  ionosphere  is  not  exactly 
horizontally  stratified.  Even  if  such  a height  step  can  be  obtained  accurately  the  retardation  of  the  signal  within  this 
height  interval  has  to  be  known  for  an  exact  calculation  of  the  remainder  of  the  profile.  On  the  other  hand,  it  is  in 
the  nature  of  the  inversion  process  that  errors  are  reduced  relatively  fast  with  increasing  frequency.  This  means,  for 
example,  that  for  a first-order  estimate  of  the  “width"  of  a valley  between  E and  F regions  and  a reasonable  model 
for  electron  density  distribution  in  the  valley,  the  error  in  height  for  densities  near  the  maximum  of  the  F region 
will  be  in  the  order  of  1 km,  even  if  the  height  step  for  the  valley  is  wrong  by  10  km,  provided  that  the  maximum 
density  of  ihe  F region  is  at  least  3 or  4 times  the  maximum  density  in  the  E region. 

More  insight  in  the  error-reducing  process  can  be  obtained  in  the  following  way  : We  assume  the  profile  is  cal- 
culated up  to  a height  /ic(0n-l),  the  index  c indicating  that  it  is  only  approximately  correct.  The  virtual  height 
/r<0M)  is  now  used  to  calculate  the  portion  of  the  profile  between  0n-i  and  <?> j.  The  resulting  height  h(<j>n)  is 
again  incorrect.  If  an  index  e indicates  the  exact  profile,  we  have 


h\<t>„)  = \ !xz'e  dZ/p2 


(28) 


On  the  other  hand,  we  use  the  equation 


h'(%)  = h0  + 


+ 


MZC  d fN 


7 


(29) 


to  determine  the  height  interval  between  4>n-\  and  4>n-  If  we  take  the  difference  between  Eqs.  (28)  and  (29),  we 
obtain 


»' (z'e-z'cWN2 


+ 


nXz'e-z'cWN’ 


= 0 


(30) 


If  we  now  assume  that  the  difference  between  the  correct  and  incorrect  slopes  ts  always  positive  or  always  negative, 
the  mean  value  theorem  can  be  applied  and  we  obtain 


(31) 


or 


M*n)  - M^)=(i-^7-)[m^-i)-M^-i)]  (32) 

where  m\  is  some  value  of  m in  the  lower  interval  and  m'u  correspondingly  in  the  upper 

interval  From  Eqs.  (21)  and  (22)  we  know  that  /if  < m'u  an(f  since  m > 1 we  see  that  the 

height  error  at  0(J  is  less  than  the  error  at 
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Ah{<t>n)<  AA(0m-i) 


(33) 


0<  1 --^f  < 1 (34) 

Mu 

This  property  of  the  inversion  process  is  very  important,  especially  if  limited  computing  facilities  exclude  the  joint 
use  of  both  components  for  electron  density  calculations.  In  this  case  reasonable  assumptions  must  be  made  about 
the  undetected  portions  of  the  profiles.  While  the  use  of  both  components  permits  an  error  estimate  based  on  a 
least-squares  fit  of  the  recalculated  virtual  heights  to  the  observed  virtual  heights  of  both  components,  no  error 
estimate  can  be  obtained  if  one  component  is  used  together  with  some  assumptions.  On  the  other  hand,  since  we 
know  that  the  errors  are  decreasing  with  increasing  frequency , we  can  expect  that  the  profile  calculated  will  come  close 
to  the  true  profile  at  least  near  a maximum  of  a layer,  often  the  most  important  portion  - for  example,  for  a world- 
wide study  of  the  ionosphere  - or  its  time  behavior  at  a fixed  location. 

It  is  standard  practice  to  extrapolate  h'  linearly  to  zero  frequency  for  the  starting  process  if  the  slope  of  the 
virtual  height  is  positive  at  the  beginning  of  the  ionogram.  A valley  correction  of  about  the  same  sophistication  can 
be  applied  easily  using  the  following  rule  for  the  input  data  hj  if  n0fjS  are  the  observed  data 


hi  “ h0bs 


f<fcF  and  0</of 


h'i  ~ 2h'obs~h'mnE  f**fcE  and  4>>foE 


hi  ~ h0f,s 


f>fcE  and  0>/oE 


which  simply  means  that  for  the  calculation  of  the  F layer  we  double  the  virtual  path  within  the  E layer  corres- 
ponding to  the  assumption  that  the  E layer  is  symmetrical,  which  can  be  roughly  justified  by  the  physics  of  layer 
formation  in  the  ionosphere. 

A similar  rule  can  be  applied  to  nighttime  ionograms  if  the  virtual  heights  are  decreasing  with  increasing  fre- 
quency at  the  beginning  of  an  ionogram.  In  this  case,  we  know  that  there  is  another  layer  below  the  observed  one 
with  a critical  frequency  less  than  the  frequency  where  the  first  echo  is  seen.  Sometimes  it  is  possible  to  extrapo- 
late for  this  critical  frequency,  and  the  retardation  in  this  lower  layer  can  be  estimated  by  simple  assumptions  such 
as  a constant  slope  in  the  F region  and  a parabolic  profile  of  the  E region.  If  such  an  extrapolation  of  the  critical 
frequency  is  not  practical  we  may  instead  use 

hi  ~ /"'•Anin 

(38 

h(  = hj  f^fmn 

where  h'm  is  the  minimum  virtual  height  at  the  frequency  fm.  Equation  (38)  surely  is  harder  to  justify  than  Eqs. 
(35)  through  (37),  but  it  assures  at  least  that  the  correction  is  in  the  right  direction. 

Summarizing  we  see  that  the  accuracy  of  profile  calculations  is  always  limited.  Sometimes  it  is  not  even  pos- 
sible to  obtain  a reliable  error  estimate  in  the  sense  that  the  errors  of  the  results  should  bear  a simple  relation  to  the 
estimated  errors  of  the  virtual  heights.  This  holds  at  least  for  the  valley  and  starting  problem.  There,  depending 


mainly  on  the  dip  angle,  the  joint  use  of  both  components  can  help  to  reduce  the  errors  significantly.  The 
error  is  decreasing  with  increasing  frequency,  but  it  also  depends  very  much  on  the  structure  of  the  ionosphere. 
Nevertheless,  in  many  cases  the  results  in  the  vicinity  of  a maximum  of  a layer  are  more  accurate  than  the 
virtual  heights,  due  to  the  error  reduction  inherent  in  the  process  as  can  be  demonstrated  by  model  studies. 
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DISCUSSION 


Reinisch:  Is  the  programming  required  for  routine  work  already  done?  if  so  do  you  now  use  it? 

Paul:  This  is  just  a study  and  not  in  routine  use. 

Wright:  You  are  asking  if  this  Abel  generalization  has  been  used  in  a working  method  to  deduce  valleys.  The 

N(h)  formulation  that  Dr.  Paul  has  devised  is  in  use  to  obtain  profiles.  Another  process  that  makes  use 
of  the  ordinary  and  extraordinary  information  is  in  routine  use  to  resolve  the  valley  ambiguities  so  far  as 
permits.  To  perform  this  solution,  some  assumptions  are  needed  about  the  valley  parameters. 

Reinisch:  Do  you  have  any  figures  on  the  frequency  of  occurrence  of  the  valleys  and  its  average  depth? 

Paul:  We  cannot  detect  that.  You  can  do  something  like  Jackson  did.  Your  results  depend  on  the  model. 
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ABSTRACT 

A method  is  described  to  accomplish  automatic  data  selection  and  profile  inversion  to  obtain  ionospheric 
electron  density  profiles  from  digitized  radio  soundings.  The  profile  inversion  is  based  on  the  well-established 
formulation  of  Paul  [9960]  by  which  the  optimum  radio  frequency  sounding  intervals  can  be  specified  from  an 
approximate  knowledge  of  the  profile;  the  expected  virtual  height  coordinates  (h')  at  these  frequencies  (/)  are 
estimated,  and  procedures  are  then  used  to  select  h '(/)  observations  nearest  the  predicted  coordinates  from  a 
subsequent  digital  ionogram.  From  these  the  next  profile  is  obtained.  The  process  adaptively  follows  the  changing 
shape  and  detail  of  the  profile.  The  procedure  requires  an  average  of  IS  sec  per  profile  on  a standard  data  processing 
computer,  and  can  be  adapted,  with  benefit  to  online  real-time  use  in  our  digital  ionosonde. 
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N7  3-'l 


INTRODUCTION 


Long  before  the  age  of  digital  computers,  the  panoramic  ionosonde  had  established  the  swept-frequency 
ionogram  as  a convenient  and  subjectively  informative  representation  of  ionospheric  structure.  To  a considerable 
extent  phenomena  in  the  ionosphere  arc  recognized,  described,  and  named  - if  not  necessarily  unders'^od  - by 
their  ionogram  manifestations.  These  qualitative  and  pictorial  applications  sufficiently  justify  systems  designed  to 
make  such  recordings,  but  the  greater  usefulness  of  quantitative  information  has  led  to  progressively  more  elaborate 
schemes  for  their  analysis.  Radio  echoes  from  virtually  the  entire  near  side  of  the  ionosphere  are  recorded  on  the 
program;  the  inversion  problem,  and  most  of  the  formidable  difficulties  of  deducing  the  profile,  has  been  recognized 
from  the  beginning.  But  in  part  because  it  was  relatively  easy  to  “scale”  simple  parameters  in  great  quantity  from 
ionograms,  and  also  because  profile  inversion  involves  awesome  arithmetic,  only  scattered  efforts  to  deduce  profiles 
were  attempted  until  electronic  computers  emerged  to  assume  the  principal  burden.  Even  so,  20  to  SO  virtual 
height-frequency  readings  are  requit  ed  as  input  information  to  the  inversion  process;  when  this  scaling  is  done  by 
hand  from  film  ir.iogram  recordings,  the  labor  remains  excessive  and  accuracy  compromised;  yet  the  task  requires 
experienced  judgment,  since  n.u  ~h  tf  the  echo  information  of  the  ionograms  in  irrelevant  to  the  profile. 

The  next  stage  in  the  evolution  of  these  techniques  is  currently  under  way:  the  application  of  digital  hardware, 
which  we  believe  provides  an  approach  to  the  solution  of  the  remaining  problems.  Small,  fast . general  purpose  digi- 
tal computers  are  now  available , at  a fraction  of  the  cost  of  inflexible  special-purpose,  fixed-wire  control  systems. 
When  interfaced  to  the  essential  elements  of  the  ionosonde  (transmitter,  receiver,  frequency-synthesizer)  the  general- 
purpose  computers  assume  all  ci  the  functions  of  system  control,  echo  recognition,  parameter  digitization,  and  pre- 
liminary or  even  final  data  processing.  This  is  the  orientation  of  our  program  of  digital  ionosonde  (Dynasonde) 
development  ( Wright,  1969] , and  it  consists  of  three  interdependent  areas  of  activity: 

1 . Development  of  the  Dynasonde  system  itself,  including  the  selection  of  digital  and  analog  system 
components,  interfac4*  \,  and  the  refinement  of  control  software  for  the  Dynasonde  online  general- 
purpose  computer. 
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2.  Development  of  the  Kinesonde,  an  independent  and  more  narrowly  oriented  hardware  system  m which  we 
concentrate  on  the  measurement  and  statistical  analysis  of  ionospheric  radio  echo  parameters  as  time  series, 
for  studies  of  Ionospheric  motions  and  microstructure;  these  additional  measurement  capabilities  aie  then 
achievable  directly  in  the  Dynasonde  system  through  straightforward  software  design 

3.  Development  o'  data  analysis  methods  to  accomplish  specific  measurement  objectives  such  as  echo  location 
(angle  ot  irrival)  and  ionospheric  “drifts”  from  spaced-antenna  measurements,  or  the  extraction  of  iono- 
spheric electron  density  profiles  from  digitized  group-path  versus  frequency  measuiements.  Although  our 
goal  in  each  case  is  to  accomplish  these  analyses  online  and  in  realtime  (within  the  Dynasonde).  it  is  expe- 
dient to  approach  their  development  oifline,  using  tape  recorded  digital  data  and  a large,  general-purpose, 
easily  progiamrned  central  computer. 

The  paper  reports  our  progress  toward  the  specific  objective  of  determining  ionospheric  electron  density  profiles 
automatically  - that  is,  without  the  need  for  manual  data  scaling  s - subjective  echo  pattern  recognition.  Measure- 
ments for  this  purpose  are  being  made  by  the  prototype  Dynasorbe,  the  main  features  Oi  which  have  been  described 
recently  [ Wnght,  1969) , and  are  not  repeated  here,  except  to  illustrate  pictorially  the  digital  lonograms  themselves. 
Our  approach  to  automatic  profiles  is  heavily  influenced  by  Paul’s  [1960]  illuminaong  formulation  of  the  profile 
inversion  problem  ar.d  developed  into  our  present  general-purpose  working  system  bv  Paul  (1967) , Ho'vc  and 
McKinnis  (1967), and  Wright  [1967).  In  the  section  we  outline  this  formulation,  emphasizing  some  of  its  propei- 
ties  especially  appropriate  to  our  present  oojective.  We  also  describe  the  “AutoA(/i)”  process,  which  consists  of  a 
group  of  computer  subroutines  each  of  clearly  definable  structure,  purpose,  and  interrelationship.  Some  results  of 
the  AutoJV(ft)  system  are  illustiated. 


FORMULATION  OF  THE  PROFILE  INVERSION  PROBLEM 


The  vertical  component  of  the  group  path  of  a vertically  incident  radio  wave  pulse  in  a plane-stratified  iono- 
sphere determines  one  observable  parameter  of  the  pulse  - its  total  travel  time.  One  naif  of  this,  multiplied  by  the 
speed  of  light,  gives  the  apparent  or  ‘virtual”  height  h'  for  the  center  frequency  / of  the  radio  wave  pulse.  Thus, 
the  group  speed  varies  along  the  path  z to  the  height  of  reflection  h,  and  the  resulting  virtual  heigh;  varies  with 
probing  frequency  as 


*'(/) 


n\f,z)dz 


(1) 


Here  the  gr  up  refractive  index  is'  is  the  ratio  of  the  speed  of  light  *.o  the  group  speed,  and  depends  on  the  frequency 
of  the  rad.o  wave,  the  direction  0 of  propagation  with  respect  to  the  earth’s  magnetic  field,  a i the  properties  of  the 
ionospheric  medium,  electron  density  and  magnetic  field  strength,  expressed  through  the  pli><ma  frequency  f\  and 
electron  gyrofrequency  ffj.  In  our  problem  ///  and  0 are  const  mi  for  a given  location,  and  fjy  is  assumed  to  vary 
with  height  (monotonically)  and  time  only.  Although  U is  double  valued,  resulting  in  two  values  c!  for  total 
reflection  of  / (identified  as  f0  and  fx,  respectively),  we  discuss  mainly  the  analy  sis  of  h'(f0)  to  obtain  z(/\). 
The  observations  also  provide  h'(fx),  which  may  be  analyzed  independently  to  give  z(/]y);  in  principle,  the  results 
should  not  depend  on  the  choice  of  o or  x,  but  in  practice  the  assumptions  of  monotonicity  and  of  only  horizon- 
tal stratification  (and  sometimes  profile  errors)  cause  inconsistencies  when  h'(f0‘  results  are  compared. 

In  part,  the  o and  x information  can  be  reconciled  to  advantage  in  allowing  a fi,  m < rmu  correction  for  nonmono- 
tonicity >i  the  profile  [Pauland  Wright,  1963;  Howe  and  McKinnis,  1967)  ,but  »•••  A »o  N(h)  development  defers 
this  problem  and  employs  h'(fQ ) alone. 
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The  inversion  of  Eq.  (1)  to  obtain  z(fa)  cannot  be  expressed  in  closed  form  because  of  the  complexity  of  p'. 
Many  step-by  step  formulations  have  been  advocated  (reviewed  by  Wright  and  Smith , 1967 ] , but  only  Paul  [1960] 
explicitly  applies  the  basic  mathematical-physical  requirements  of  the  problem  to  justify  a workable  numerical  process. 
Approaching  reflection,  where  fa(z)  ~ fo-P  tepds  to  infinity,  so  that  a step-by-step  process  for  the  inversion  of 
Eq.  (1)  based  on  slabs  of  unknown  thickness  Az  is  impossible.  The  unknown  height  is  therefore  removed  from  the 
integral  limit,  and  the  integration  is  obtained  over  the  ^dependent  variable  fa,  or  any  monotonic  differentiable 
function  of  it  <f>(fa),  by  the  transformation 


•*’(/)  = hQ  + 


i; 

Co 


iiifa) 

m,N) 


(2) 


where  h0  and  0O  refer  to  the  bottom  of  the  ionosphere. 

To  simplify  the  notation,  and  to  represent  the  true  height  and  the  retardation  sepaiately,  we  may  rewrite 
Eq.  (2)  as 


<*(/) 

h‘  = h r * (p  - l)z’  d0 


(3) 


Many  methods  of  inversion  now  proceed  by  assuming  that  zj  = (dz/d0),  = const  o.er  short  (not  necessarily  uniform) 
intervals  of  0,  whereupon  the  integral  in  Eq.  (3)  can  be  replaced  by  a summation  and  the  resulting  system  of  equations 
solved  for  the  several  unknowns  z , given  'n  equal  number  of  h' . There  are  two  defects  of  this  procedure,  however. 
The  weight  function  p’  remains  infinite  at  the  reflection  level,  sc  that  the  usual  assumption  - that  the  cons*  nt  z, 
represent  mean ' lues  at'nbutable  to  the  middle  of  their  corcespon.Ung  intervals  - is  systematically  wrong.  , as  de- 
fect mi<tfit  be  ameliorated,  as  Paul  [1960,1971]  lus  proposed,  by  specifically  assuming  mat  the  value  of  z obtained 
refers  to  the  reflection  level,  rather  *vin  to  the  interval  midponV.  However,  a more  basic  problem  exists  since  iono- 
mms  show  us  very  clearly  iliat  z is  not  discontinuous,  except  perhaps  approximately  so  at  a small  number  of  well- 
marked  “cusps”  of  the  h (/)  observations.  To  conform  with  the  observations,  it  is  essential  that  the  rc.:!;-"ity  of 
z be  prc-”ved  by  the  inversion  process  except  where  the  observations  indicate  discontinuity.  IfEo  (3)  is  integrated 
by  parts  and  z is  assumed  differentiable,  we  obtain 

r r<Xfa)  l*<f) 

*'(/)=*(/)+  |z'(/)  j \p\f.fi)  - i]d0(/i)  -J  Mifa)zXN)  J 1] 

L Co  ->0o  to  *o  ... 

(4) 

Note  that  the  retards  ;on  A now  expressed  by  two  terms  in  which  the  (variable)  first  and  second  derivative  3 along  the 
profile  are  weighted  by  a quantity  (p'-I  )d0,  which  is  everywhere  finite.  If  the  integration  of  the  term  involvin’ 
z"  ~ d*.?/d02  is  no  replaced  by  a sum  over  t intervals  in  each  of  which  z'{  is  assumed  constant,  we  obtain  a sys- 
tem of  equations  relating  discrete  values  of  the  observable  quantity  hj  to  discrete  alues  of  the  continuous  profile 
derivatives  z\  at  the  reflection  heights  h,-.  In  our  application  of  these  ideas,  we  take  0 = log  fa1 , and  define  the 
integration  variable  t = log/jy2  - log/2 . The  system  of  equations  relating  h\  and  the  profile  may  then  be  written 


h'i  = j2/- 1 Sji + */- 1 1 *p)]  (5) 
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The  quantities  g and  G depend  on  the  plasma  frequencies  used  to  divide  the  profile  and  on  the  radio  frequency, 
but  not  on  the  profile  itself.  A simple  table  of  g and  G would  suffice  for  all  profiles  if  the  intervals  were  to  be 
specified  arbitrarily  jr.d  kept  invariable.  However,  it  must  be  remembered  that  Eq.  (5)  is  an  “exact”  relation  be 
tween  the  profile  and  h'  provided  the  z-  are  constant  over  successive  intervals.  The  relation  is  thus  a good  approxi- 
mation if  the  change  in  z"  is  sufficiently  small  between  successive  intervals;  and  as  the  form  of  the  profile  varies 
from  one  occasion  to  the  next,  the  optimum  interval  widths  (hence  the  optimum  measurement  frequencies  for  ti) 
will  likewise  vary 

The  essential  principles  of  our  strategy  for  automatic  N(h)  calculations  can  now  be  stated  as  follows: 

1 . Corresponding  to  i given  change  of  z"  along  the  profile,  there  is  a maximum  permissible  interval  width 
Aiog/jy2  within  which  z"  is  approximated  by  a constant  value,  such  that  the  height  error  A z does  not 
exceed  s permitted  maximum.  In  other  words,  the  profile,  once  known,  can  be  partitioned  along  the  scale 
of  plasma  frequencies  to  determine  optimum  intervals  at  which  radio  frequencies  should  be  chosen  to 
obtaii  virtual  heights  from  which  to  determine  the  profile. 

2 Hie  apparently  circular  ar^un.c.it  of  (}'•  becomes  useful  when  two  other  principles  are  realized: 

(a)  Optimum  intervals  may  be  determined  in  an  iterative  fashion  if  an  approximate  profile  is  known  or 
can  be  assumed. 

fb)  in  a sufficiently  continuous  time-seiies  of  ionograms,  the  profile  from  one  ionogram  serves  as  an 
approximation  from  which  to  determine  the  optimum  intervals  for  measurement  of  the  next 
ionogram. 


THE  AUTOiV(h)  PROCESS 


The  system  consists  of  a group  of  subroutines 
prograr  -'cu  (at  present)  in  FORTRAN  IV  for  use  on  a 
large  centra)  computer.  The  observational  data  are  tape- 
recorded  sequences  of  digital  ionograms  obtained  by  the 
Dynasonde.  The  ultimate  aim  of  adapting  the  method  to 
online  real-time  use  vjithin  the  Dynasonde  has  been  kept 
in  view,  and  is  commented  on  later.  Figure  1 is  a block 
diagram  giving  the  principal  subroutine  names  and  inter- 
relationships. Let  us  break  into  the  loop  following  a 
PROFILE  calculation  and  trace  the  subsequent  logical 
operations: 

1 . Subroutine  GETF  uses  the  result  of  the 
PROFILE  calculati'n  to  determine  optimum 
sounding  frequency  intervals  such  that  the 
height  interval  error  caused  by  assuming 
z"  = const  in  each  frequency  interval  will  not 
exceed  a certain  chosen  limit. 


Figure  1.-  Block  diagram  of  t£<  principal  subroutines  of  the 
Auto N(h)  system  showing  briefly  their  functions  and 
interrelationships. 
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2.  For  the  newly  specified  set  of  sounding  frequencies,  and  using  the  given  true-height  profile,  subroutine 
VFROMH  calculates  the  expected  virtual  heights.  This  is  done  by  a forward  calculation  using  Eq.  (5). 

3.  Equipped  now  with  optimized  frequencies  and  their  expected  virtual  height,  F1ND1T  examines  the  next 
digital  ionogram  in  the  sequence  at  these  h'(f  ) coordinates  to  se'.ect  accurate  values  of  the  nearest  h\f) 
data. 

4.  These  values  are  tiansferred  K the  PROFILE  subroutine,  which  solves  Eq.  (5)  in  the  backward  direction 
for  the  wanted  profile. 

The  operation  of  these  subroutine.,  is  of  course  somewhat  more  complicated  than  this  simple  description  might 
suggest.  PROFILE  and  VFROMH  involve  basically  deterministic  arithmetic  treatments  of  Eq.  (5)  in  opposite  “direc- 
tions”, and  each,  given  the  other’s  output,  would  produce  the  other’s  input  identically. 

Similarly,  GETF  is  deterministic:  If  we  consider  the  Taylor  series  expansion  of  the  profile  at  a given  point  f, 
and  for  z'(tj)  write  zy: 

Mr)  = hj  + zj(t-  r;)  + 1 zj  (t  - r;)J+-  - • r,  <t<i/+1  (7) 


The  profile  is  found  using  z"  = const  in  each  interval,  but  this  yields  a succession  of  values  of  z"  that  can  be  used  to 
estimate  the  amount  by  which  z"  is  • or  constant  along  the  profile.  Thus,  the  difference  in  z"  between  successive 
intervals  gives  an  estimate  of  zj” 


= 2 


H -z/-i 
ri+l-fi-t 


(8) 


as  well  as  an  estimate  of  the  error  resulting  from  the  use  of  constant  z". 

If  the  most  important  part  of  the  error  arises  in  the  first  term  dropped  from  the  power  series,  it  is 


■ ■1 


Thus  an  allowed  frequency  step  can  be  chosen  proportional  to  (e/z”’) 1/3  to  generate  an  optimized  set  of  frequencies. 

To  a limited  extent,  the  error  permitted  in  each  interval  automatically  controls  both  the  number  of  intervals 
used  to  divide  the  profile  and  the  resulting  accuracy  of  the  profile,  and  this  “control”  would  probably  work  smoothly 
and  monotonically  with  the  permitted  error  if  the  ionogram  (and  the  profile  itself)  were  “smooth”.  In  fact,  however, 
a wide  spectrum  of  profile  fluctuations  exists;  it  might  be  feared  that  a small  pcimitted  error  would  lead  to  a “runaway” 
condition  m which  small  detail  of  the  h\f)  curve  - or  the  digitization  noise  itself  - would  demand  its  own  cluster  of 
closely  spaced  intervals.  Fortunately,  several  factors  work  against  this  tendency.  One,  under  the  user’s  control,  is  a 
frequency-height  window  to  v described,  wnich  smooths  over  h\f)  detail  of  the  order  of  its  own  size.  Perhaps  more 
important,  the  GETF  proces.  in.->  little  tendency  to  add  detail  that  is  not  already  present  in  the  preceding  profile  infor- 
mation. In  this  sense,  the  permitted  c-ror  must  be  interpreted  as  an  average  value,  appropriate  to  a smooth  curve  over 
the  interval,  whatever  detailed  variates  take  place  within  it. 

The  selective  effect  of  a given  permitted  error  in  determining  the  degree  of  resolution  of  the  ionogram  and  profile 
is  illustrated  in  figure  2 which  shows  the  dependence  (for  a particular  sequence  of  ionograms)  of  the  number  of 
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“scaling”  intervals  on  alternate  choices  of  the  permitted 
error  e.  Results  for  two  values  of  e are  snown,  and  for 
each  e,  two  initial  sets  of  scaling  intervals  were  chosen, 
comprising  5 and  34  intervals,  respectively.  As  shown, 
about  10  intervals  suffice  for  the  ionogram  sequence  if 
e = 50,  and  about  25  intervals  are  required  if  e = 10. 

The  number  of  intervals  tends  toward  these  partitioning 
values,  whatever  the  number  of  intervals  in  the  initicl  set. 
This  also  illustrates  that  the  Autoj^fh ) process  can  adap- 
tively follow  smooth,  natural  changes  in  the  amount  of 
structure  of  the  profile,  and  that  the  accuracy  of  the  auto- 
matic process  is  not  necessarily  constrained  by  the  amount 
of  detail  included  in  the  initial  scaling. 

Figure  3 represents  a digital  ionogram  from  the 
Dynasonde  tape  recordings  graphed  by  a computei- 
drive  microfilm  plotter  with  scale  proportions  chosen  to 
simulate  a conventional  analog  ionogram.  In  the  upper 
part  of  figure  3 two  sets  of  scaling  intervals  have  been 
plotted  on  the  ionogram.  The  arrows  indicate  the  fre- 
quencies of  the  sparse  im‘ial  “scaling”,  and  the  O' s are 
the  optimized  intervals  as  calculated  in  GETF  (e  = 10) 
and  plotted  at  the  ti  coordinates  calculated  by 
VFROMH.  Because  the  original  scaling  does  not  ade- 
quately describe  the  k\f)  curve,  the  virtual  heights 
calculated  for  the  optimize  frequency  set  do  not  fall 
on  he  ti{f)  curve  at  all  points,  particularly  near  f0E. 
Subroutine  FINDIT  is  given  the  task  of  reconciling  such 
predicted  ti(  f ) values  with  the  nev*  ionogram  of  the 
series. 

Subroutine  FINDIT  is  the  principal  link  between 
the  digital  ionogram  and  the  profile.  It  is  here  that  the 
Auto  N(h)  process  adapts  itself  to  the  time-varying 
ionosphere,  that  echoes  relevant  to  the  profile  are  distin- 
guished from  those  that  are  not,  and  that  moderate 
smoothing  of  the  h\f)  data  is  permitted.  FINDIT  is 
provided  with  a sequence  of  predicted  (h',f)  coordi- 
nates, and  has  the  task  of  finding  values  acceptably  near 
these  coordinates  from  the  array  of  digitized  ti(f)  data 
of  the  ionogram.  Tire  initial  ti  values  are  usually  calcu- 
lated from  the  previous  profile  by  VFROMH  at  frequencies 
specified  from  this  profile  by  GFTF.  Alternatively, 
FINDIT  may  be  given  an  approximate  set  of  initial  h',f 
coordinates  scaled  “by  eye”;  they  could  easily  be  speci- 
fied without  “scaling”  at  digital  display  by  use  of  a light 
pen.  These  alternatives  provide  ways  of  starting  the  entire 
process. 

Whatever  the  source  of  the  initial  ti,  f set,  FINDIT 
establishes  a “window”  of  controlled  width  in  /(=  Fw) 
and  in  ti(=  Hw)  centered  on  the  given  h',f  coordinates. 


Figure  2.-  Number  of  intervals  vs.  ionogram  sequence  number 
for  two  values  of  the  error  parameter  6,  showing  initial 
number  of  intervals  at  ionogram  number  0,  and  for  iono- 
gram numbers  1 - 7 the  numb  -r  of  optimized  intervals 
determined  by  GETF. 


MHz 


DIGITAL  IONOGRAM,  BOULDER 
2 OCTOBER  1970  , 0850-05  MST 

Figure  i - (a)  Ionogram  number  1 showing  the  initial 
sparse  'scaling’  inlet  - j larrcvs)  and  the  optimized 
intervals  (O’-)  as  calculated  by  GETF.  (b)  The  echo 
positions  at  optimized  intervals  from  GETF,  as  found 
by  FINDIT,  using  h'  information  from  the  ionogram. 
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Figure  4.-  (a)  Full  digital  ionogram.  (b)  Area  within  the 
box  in  (a)  enlarged  ten  times  hi  height,  (c)  Area  within 
the  box  of  (b)  again  enlarged  ten  times  in  height,  shew- 
ing the  correction  of  the  estimated  h'  to  the  median  h' 
in  the  window  and  of  f to  the  frequency  coordinate  of 
the  median  height,  in  four  successive  steps.  Note  that 
the  third  and  final  steps  position  the  window  center 
exactly  on  measured  data  heights. 


FINDIT  first  determines  the  number  of  ti  values  within 
the  window,  the  median  ti  of  these  values,  their  quartile 
range,  and  the  frequency  cooidinate  of  the  median  ti . 

If  the  median  ti  differs  by  less  than  a preset  value  from 
the  initial  value,  the  median  (and  its  frequency  coordinate) 
are  retained  as  wanted  values  for  the  subsequent  PROFILE 
calculation.  Otherwise,  the  height  coordinate  (only)  for 
the  window  center  is  moved  to  that  median  value,  and  a 
new  median  and  other  quantities  are  found  for  this  new 
position  of  the  window.  This  process  is  repeated  a maxi- 
mum permitted  number  of  times  or  until  the  change  'in 
ti  only)  from  one  iteration  to  the  next  is  less  than  the 
preset  tolerance. 

The  arbitrary  parameters  of  this  process  are  obvious, 
and  can  be  chosen  within  wide  limits  without  significantly 
affecting  the  resulting  profile.  Note  that  the  actual  fre- 
quency coordinate  of  the  selected  echo  is  reported,  but 
that  the  window  is  not  permitted  to  move  ir.  the  frequency 
direction.  This  assures  that  the  frequency  intervals  remain 
near  (within  Fh-  of)  the  optimum  intervals  determined 
by  GETF. 

The  lower  part  of  figure  3 illustrates  the  echo  selec- 
tion results  of  FINDIT,  subsequent  to  the  GETF  and 
VFROMH  operations  that  led  to  the  upper  part  of  figuie 
3.  The  window  mot.-m  operations  have  successfully 
located  the  E region  echoes  near  the  wanted  frequencies. 

Figure  4 shows  a typical  sequence  of  the  window 
operations.  The  lower  part  (a)  is  again  a computer-driven 
microfilm  plot  of  all  of  the  ti,f  coordinates  digitized  by 
the  Dynasonde.  The  other  parts  of  the  figure  were  also 
plotted  by  the  computer,  and  they  zoom  to  successively 
smaller  portions  of  the  h\f)  array.  From  GETF,  an 
h',f  coordinate  has  been  “predicted”  at  a position  within 
the  “box  outlined  on  the  computer-plotted  full  ionogram, 
and  in  the  middle  part  of  the  figure  only  this  region  has 
been  replotted,  full  scale  (b).  Neither  the  outlined 
region  in  pa;t  (a)  nor  that  in  pa  t (b)  represents  the 
window  described  above;  parts  (a)  and  (b)  are  intended 
simply  to  emphasize  the  level  of  ionog'am  resolution 
(in  ti  and  /)  at  which  the  FINDIT  process  is  opeiative. 
In  the  upper  part  (c)  of  the  figure  the  initial  center  and 
dashed  outline  of  the  window  is  shown.  The  0.2-km 
digitization  resolution  of  ti  and  the  uniform  frequency 
spacing  i f the  ionograms  usually  programmed  in  the 
Dynasonde  are  clearly  evident  at  this  magnification  of 
the  ionognm.  The  initial  ti  (f)  position  does  not 
happen  to  lie  on  an  echo,  but  this  is  rectified  in  the  first 
step,  where  the  frequency  coordinate  is  set  to  that  of 
the  median  ti  within  the  window.  The  window  is 
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then  centered  (in  h only)  on  that  median  h , and  the  process  is  repeated.  In  this  example,  the  final  window 
position  is  shown  by  the  continuous  outline. 

FINDIT  has  special  provisions  for  expanding  the  window  should  the  first  look  contain  fewer  than  an  expected 


number  of  echoes;  decisively  empty  windows  are  simply 
dropped  from  further  consideration.  This  is  appropriate, 
for  example,  for  interval  limits  lying  near  a decieasmg 
f0F 2.  Conversely,  if  the  highest  frequency  interval 
limit  seems  too  low  in  frequency  (according  to  an  empi- 
rical criterion  that  involves  an  extrapolated  critical  fre- 
quency), then  more  points  are  added  at  t>e  end  and 
their  virtual  heights  are  estimated  (V'ROMH)  using  the 
z and  z"  values  in  the  last  previously  existing  interval. 
Any  such  “feeler  point”  that  FINDIT  can  validate  by 
exampling  the  ionogram  is  retained  with  its  coordinates 
adjusted  to  the  observed  values.  Together,  these  pro- 
visions follow  increases  or  decreases  of  the  peak  density. 

The  next  step  in  this  loop  is  the  PROFILE  calcula- 
tion using  the  data  obtained  by  FINDIT  from  the  new 
ionogram.  The  PROFILE  subroutine  is  virtually  identical 
to  the  program  described  by  Howe  and  McKinnis  [1967] . 
Coefficients  [Eq.  (6)]  for  each  iterval  are  calculated  as 
needed;  a parabolic  peak  is  determined  by  extrapolating 
Hz’,  and  a topside  extrapolation  can  be  performed  if 
desired.  The  principal  description  of  the  profile  is  in 
terms  of  the  height  and  z values  at  the  several  discrete 
values  of  log,0/}y,  or  their  equivalent  values  of  Nel cm3; 
both  parameters  are  tabulated  as  in  table  1 . The  cumula- 
tive total  electron  content  is  also  evaluated.  Since  z is 
continuous  between  tabulated  intervals,  a second-order 
interpolation  can  be  fierformed  for  Ne(or  log,0//y)  at 
wanted  fixed  heights.  A subroutine  is  available  which 
calculates  Ne  at  each  10-km  level  and  provides  punched 
cards  with  this  information. 


EXAMPLES  OF  THE  AUTOW(/i)  PROCESS 


As  discussed  above,  the  permitted  error  e and  the 
window  size  influence  the  amount  of  detail  the  AutoN(/i) 
process  will  recognize  in  a sequence  of  profile  calculations. 
Figures  S and  6 are  examples  in  which  the  process  follows 
fine  and  gross  structure. 

Figure  5 shows  four  ff-layer  height  profile?  corres- 
ponding to  the  first,  fifth,  tenth,  and  fifteenth  profiles  in 
a series  spanning  approximately  an  hour.  These  profiles 
were  calculated  with  e = 2 m and  involved  about  20  in- 
tervals to  describe  the  fine  detail  of  the  f-region  h'(f) 
information.  The  initial  profile  (1002  MST)  shows 


Figure  5.-- £-iayer  Auto N(h)  profiles  showing  small,  but  sig- 
nificant height  and  stricture  changes  over  a one  hour  period. 


Figure  6.-  Electron  density  variations  at  fixed  altitudes  from 
a 5-1/2-hour  period  calculated  by  the  Auto N(h)  process 
from  an  initial  manual  scaling  at  1205  MST. 
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TABLE  1.- Tabulated  ,V(6l  profile  produced  by  the  AutoA'(A)  process.  The  first  column  lists  values  of  log10/]v  representing  the  optimized 
frequence  inters  jIs  found  by  GETF  from  the  previous  profile.  The  next  column  (H*)  lists  the  observed  h'  from  the  digital  tonogram ; 
values  of  h'  are  digitized  to  0. 2 km  resolution,  and  used  i this  accuracy  in  the  profile  calculation,  but  are  rounded  in  printing  here  to  10  km 
The  word  EXTRAP  signifies  a provision  to  extrapolate  for  underlying  ionization,  not  employed  here.  Columns  H,  Z*  and  Z**  represent  the 
calculated  real  heights  and  the  first  and  second  height  denvatives.  The  cumulative  integrated  electron  content  and  the  electron  density  equiva- 
lent to  log/yy  complete  the  profile  information.  The  F-layer  peak  extrapolation  is  listed  on  the  bottom  line.  The  quantity  S represents  the 
quarter-thickness  of  a parabola  continuous  (in  Z*)  with  the  profile  below. 
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inflections  (corresponding  to  cusps  on  the  h'  curve)  at  log10/]y  ~ 0.43  and  ~ 0.53.  About  20  min  later  the 
inflection  at  ~ 0.43  has  disappeared,  the  inflection  at  ~ 0.53  appears  at  a lower  height,  and  a new  inflection  has 
appeared  at  ~ 0.57.  The  same  structure  is  present  at  1045,  but  at  slightly  lower  heights.  At  1105  the  orofile  structure 
is  slightly  higher  than  the  previous  profile,  and  the  inflections  are  less  pronounced.  Careful  comparison  of  the  profiles 
with  the  ionograms,  and  with  the  other  profile  calculations  of  the  sequence,  shows  that  there  are  significant,  if  small, 
changes  of  height  and  structure  in  the  E region.  The  AutoAW  process  can  easily  follow  evident  height  variations 
in  density  to  better  than  0.1 -km  resolution. 

Operational  conflicts  for  the  Dyn«sonde  make  it  difficult  at  present  to  obtain  satisfactorily  k,ng  sequences  of 
digitized  ionograms  with  which  to  demonstrate  the  performance  of  the  AutoMh)  process  in  following  complete 
diurnal  variations.  A partial  example  of  this  kind,  itowever,  is  shown  in  figure  6.  The  calculated  electron  density  time 
variations  at  a number  of  fixed  altitudes  on  21  June  1971  are  plotted  over  the  period  1205-1735  MST  at  5-min  inter- 
vals. The  sequence  (of  67  profiles,  requiring  15  min  computation  time)  was  obtained  entirely  automatically  after 
initiation  by  a rough  manual  scaling  ol  the  first  (1205)  ionogram,  which  was  computer-plotted  for  this  purpose  in  the 
format  of  figure  4(a).  The  hourly  ionograms  also  were  plotted,  and  the  scaled  values  of  fpF2  (converted  to  electron 
density)  are  included  on  figure  6.  After  1730  the  h'(f)  curve  became  unrecognizable  because  of  multiple  sporadic 
E reflections.  Short-period  changes  that  occur  simultaneously  and  similarly  at. all  altitudes  (e.g.,  at  1540)  are  probably 
introduced  by  the  AutoAl(h)  process  itself,  by  faulty  following  of  the  h'(f ) information.  However,  the  main  features 
of  this  part  of  the  diurnal  variation  are  well  represented  and  most  of  the  indicated  structural  variations  are  significant. 
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FUTURE  DEVELOPMENTS 


The  process  described  here  is  not  in  a final  form,  but  it  works  sufficiently  well  that  it  is  already  of  practical 
utility.  The  main  problems  arise  when  very  rapid  changes  or  genuine  echo  ambiguities  occur;  olten  a human 
analyst  would  then  also  not  be  sufficiently  informed  by  the  usual  ionogram  to  resolve  the  ambiguities.  The  addi- 
tional problem  with  an  automatic  N(h)  process  is  that,  once  derailed,  it  is  not  certain  to  get  once  again  on  the  track 
of  the  true  profile,  in  practice,  however,  our  process  does  realize  appreciable  stability  against  such  derailments,  and 
the  way  to  further  improvement  is  clear.  Even  within  the  h‘{f)  information  alone,  the  computer  can  rapidly  per- 
form consistency  tests  - for  example,  between  o and  x,  among  multiple  echoes,  or  between  successive  ionograms 
forward  and  backward  in  time  - that  are  tedious  and  relatively  ineffective  when  attempted  subjectively. 

The  AutoN(h)  process  provides  a necessary  link  between  a digitized  data-acquism  on  system  (here,  the 
Dynasonde)  and  the  data  user.  Since  the  acquisition  of  fully  digitized  ionograms  (fig.  4)  is  already  an  automatic, 
efficient,  and  reliable  process,  the  N(h)  analysis  could  be  performed  centrally  from  data  supplied  by  remote  sounders. 
If  tape  recordings  are  used,  the  quantity  of  tape  required  remains  finite  '.ince  it  can  be  reused.  Alternatively,  the  entire 
system  is  compatible  with  centralized  ionosoude  control  and  data  processing  via  any  data  link  of  moderate  capacity. 

The  AutoN(h)  concepts  can  provide  entirely  new  capabilities  to  the  Dynasonde  when  they  are  included  as  a 
part  of  its  own  real-time  software.  Since  only  a very  small  fraction  of  the  echo  information  of  an  icnogram  is  required 
for  a profile,  and  since  our  process  determines  both  the  optimum  sounding  frequencies  and  the  expected  echo  arrival 
times,  the  usual  ionospheric  sounding  process  can  be  tremendously  compre.  sed.  While  perhaps  2000  ieal  echoes  from 
1 500  discrete  frequencies  are  contained  in  the  average  ionogram  (requiring  4 times  this  number  of  pulse  transmissions 
at  200/sec  in  our  real-time  echo  recognition  process),  only  about  20  pieces  of  h\f)  infoimation  are  required  for  the 
profile.  When  the  optimum  frequencies  are  known  with  their  expected  echo  arrival  times,  only  1%  to  3%  of  the  nor- 
mal ionogram  sounding  activity  is  required.  The  savings  can  be  realized  in  many  otherwise  unattainable  advantages, 
including 

1 . Greatly  reduced  radio  interference . The  necessary  h'(  f ) information  car.  be  acquired  at  a random  pulse 
emission  schedule,  or  in  a burst  of  only  0.1  sec  duration,  and  the  frequencies  are  already  tangibly  “rando- 
mized” by  the  GETF  conditions. 

2.  Improved  time  resolution.  The  N{h)  conversion  represents  an  amount  of  arithmetic  requiring  less  time 

on  a fast  minicomputer  than  the  echo  pulse  travel  time;  thus,  the  profiles  can  be  produced  at  virtually 
any  desiied  rate  within  this  natural  limitation.  < 

3.  Acquisition  and  use  of  additional  information.  Among  many  examples,  of  particular  relevance  is  the  use 
of  echo  angle-of-arrival  information  to  discriminate  ovtihead  structure,  to  improve  Auto  N(k ) echo  selec- 
tion, 3nd  to  describe  tilts  and  horizontal  structure.  The  Dynasonde  accommodates  an  addressable  receiving 
antenna  array  and  digitizes  the  complex  amplitude  of  each  echo  at  each  antenna  to  provide  such  information 
as  part  of  its  Kinesonde  functions. 

In  the  larger  context  of  Dynasonde  applications,  automatic  N(h)  profiles  become  considerably  more  than  a 
desirable  data  output  of  the  system;  they  provide  essential  reference  information  for  adaptive  application  of  the 
Dynasonde  toward  most  other  measurement  objectives. 
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DISCUSSION 


G rossi:  How  do  you  distinguish  between  the  ordinary  and  extraordinary  returns  when  they  cross? 

Wright:  If  you  know  the  profile  you  can  calculate  both  curves,  and  the  FINDIT  process  can  be  set  to  the  task  of 

fin  ling  the  information  nearest  these.  This  leads  then  to  the  identification.  If  you  know  the  profile,  based  on 
previous  results,  you  can  predict  this.  Note  that  the  pr  cess  can  run  forward  or  backward.  Also,  the  angle-of- 
arrival  feature  is  very  important  here. 

Reinisch:  With  regard  to  frequency  selection,  are  you  sure  of  convergence  in  the  loop  so  you  get  reasonable  fre- 

quencies? What  do  you  do  if  they  are  unreasonable? 

Wright:  It  is  not  a fully  refined  process  yet.  It  is  workable,  however,  as  a research  tool  now. 

Grussi:  Have  you  ever  computed  how  much  time  you  have  to  wait  until  the  cross-correlation  coefficient  of  the 

lonogram  drops  to  ! 'e? 

IVrig/it:  You  are  asking  about  the  autocorrelation  time  of  the  electron  density  at  a fixed  height  and  location.  No, 

this  has  not  been  done,  mainly  because  of  a lack  of  adequate  time  resolution  and  statistics  in  the  past.  The 
Auto  N(h)  process  is  a w„y  to  gain  these  statistics,  when  implemented  on  a digital  lonosonde. 
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IONOSPHERIC  PROFILE  INVERSION  USING 


OBLIQUE-INCIDENCE  IONOGRAMS 
D.  L.  Nielson  and  T.  M.  Watt 
Stanford  Research  Institute,  Menlo  Park,  California 

N73-11607  ABSTRACT 

This  paper  reviews  some  of  the  elementary  methods  used  in  deriving  true-height  profiles  from  oblique-incidence 
ionosgrams.  The  two  principal  methods  presented  are  oblique-to-vertical  transformation  and  direct  inversion  of  the 
oblique-incidence  ionogram.  Limitations  in  oblique-incidence  inversion  due  to  magnetic-field  effects,  horizontal 
gradients,  and  absolute  time  delay  are  discussed. 


INTRODUCTION 


In  comparison  with  the  amount  of  attention  given  electron-density  profile  measurement  it  vertical  incidence, 
use  of  the  equivalent  technique  at  oblique  incidence  has  been  almost  totally  ignored.  This  avoidance  is  attributable 
in  part  to  the  lack  of  facilities  with  which  to  perform  the  measurement,  but  more  to  the  limitations  and  inaccuracies 
encountered.  While,  in  principle,  the  measurement  of  any  oblique  wave  can  reveal  something  about  the  intervening 
ionosphere,  in  practice  many  effects  limit  the  accuracy  of  what  can  be  inferred  from  that  measurement. 

It  is  useful  to  define  several  concepts  that  are  addressed  in  this  paper.  There  are  two  sounding  geometries  - 
vertical  and  oblique.  There  are  two  possible  operational  modes  - monostatic  and  bistatic.  All  sounding  referred 
to  in  this  paper  takes  place  in  the  MF  and  HF  bands  of  the  frequency  spectrum  (~  1 MHz  to  30  MHz). 

Sounding  at  vertical  incidence  involves  propagating  a signal  vertically  and  receiving  the  reflected  signal  at  the 
same  ground  station.  Most  soundings  at  these  frequencies  involve  only  linearly  polarized  waves,  and  a common  an- 
tenna is  used  for  both  transmitting  and  receiving;  this  operating  mode  we  refer  to  as  monostatic.  In  certain  special 
cases  such  as  the  use  of  circularly  polarized  waves,  it  would  be  necessary  to  use  separate  antennas  for  transmitting 
and  receiving;  this  operating  mode  could  be  called  bistatic. 

Sounding  at  obliqu^ incidence  involves  transmitting  a signal  such  that  the  wave  does  not  reflect  back  directly 
to  the  transmitter  site  but  rather  propagates  obliquely  through  the  ionosphere  and  returns  to  the  earth  at  a remote 
location.  Monostatic  operation  at  oblique  incidence  is  possible  using  ground  backscatter.  In  ground  backscatter, 
the  signal  propagates  via  the  ionosphere  and  is  scattered  on  reaching  the  ground.  A small  fraction  of  the  energy 
returns  to  the  transmitter/ receiver  site.  A somewhat  similar  setup  in  a bistatic  mode  would  result  from  using  a 
tiansmitter  and  receiver  at  one  site  and  a transponder  at  a remote  site.  A transponder  located  at  a remote  site 
receives,  amplifies,  and  retransmits  the  signal  back  to  the  sounding  site.  The  advantages  of  a transponder  are  that  it 
provides  an  amplified  return  signal  and  that  its  location  can  be  precisely  determined.  7'he  most  common  oblique 
sounding  mode  is  bistatic,  using  a transmitter  at  one  location  and  a receiver  at  a remote  locatK  i. 

The  reason  most  often  cited  for  oblique  sounding  for  profile  measurement  is  the  greater  accessibility  to  re  iote 
areas  that  it  provides.  The  use  of  most  oblique-incidence  sounding,  however,  has  been  closely  related  to  operational 
communication  needs,  and  the  area  of  profile  inversion  has  been  of  no  direct  interest  to  the  users. 


* 

* 


Whi’e  ground  backscatter  can  be  successfully  employed  in  estimating  single  parameters  such  as  critical 
frequency,  its  use  in  profile  measurement  is  severely  limited.  Problems  such  as  measurement  of  elevation  angle,  the 
relatively  poor  knowledge  of  the  location  of  ground  and  ionospheric  reflection  points,  and  the  extensive  ionospheric 
araa  involved  in  typical  sweep-frequency  operation  have  prev:nted  its  use  for  this  purpose. 

The  instrumentation  for  oblique-incidence  pulse  soundings  is  not  unlike  that  for  vertical-incidence  sounding.  In 
the  case  of  monostatic  backscatter  soundings,  the  difference  could  be  merely  in  the  antenna;  however,  other  signal- 
processing measures  can  be  taken  to  improve  the  quality  of  the  records.  The  most  satisfactory  records  for  profile 
measurement  at  oblique  incidence  are  termed  oblique  tonograms.  These  records  of  pulse-group  delay  are  the  result 
of  opeiating  a separated  sweep-  or  step-frequency  'ransmitter/receiver  pair  synchronized  in  time  (and  consequently 
in  frequency).  Although  oblique  sounding  was  originally  limited  to  am  pulse  waveforms  with  center  frequency 
swept  over  the  range  of  propagating  frequencies  [Agy  et  al .,  1959;  Moller , 1964;  Wilkens,  1960]  .synthesized  step- 
frequency  pulse  [Nielson,  1966] , sweep  frequency  CW  (chirp)  [Fenwick  and  Barry,  1966] , and  extremely  wideband 
high-power  pulse,  have  moie  recently  been  used  with  equipment  developed  at  Stanford  Research  Institute.  Except 
for  transponder  systems,  all  oblique-sound  g systems  require  stable  oscillators  to  maintain  timing  accuracy,  \bsolute 
delay  is  therefore  not  normally  available. 


Ignoring  the  birefringence  of  the  ionosphere,  the  lowest  order  ray  at  vertical  incidence  is  limited  to  a single  ray 
at  a single  frequency.  Most  ionospheric  profiles,  on  the  other  hand,  give  rise  at  oblique  incidence  to  a high  and  low 
ray  at  frequencies  above  the  vertical-incidence  critical  frequency.  Figure  1 illustrates  the  relationship  between  group 
delays  at  vertical  and  oblique  incidence.  One  curve  can  be  considered  a transformation  of  the  order.  Under  certain 
assumptions,  the  transformation  is  very  simple  and  bilateral;  without  such  assumptions,  it  may  be  neither.  In  this 
papei  we  discuss  these  transformations  and  their  attendant  assumptions  in  some  detail. 


Consider  the  problem  of  inverting  an  oblique  ionogram  such  as  that  shown  in  figure  1,  under  the  assumptions 
of  a spherical  earth  and  a concentric,  isotropic  ionosphere.  Using  the  nomenclature  of  Croft  and  Hoogasian  [196o] , 
as  illustrated  iri  figure  2,  we  have 
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Figure  1.-  Correspondence  between  vertical  and 
oblique-incidence  ionogran'i 


Figure  2.-  Nomenclature  for  an  oblique  ray  path. 
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rn  cos  (3  = r0Ho  cos  ft,  = x0  cos  ft, 


(3) 


where  p is  the  refractive  index  at  radial  distance  r.  The  quantities  D and  P'  are  known  or  measured  values,  and 
Eq.  (3)  is  a form  of  Snell’s  law  (Bouger’s  rule).  For  the  conditions  considered,  Eqs.  ( 1 ) through  (3)  arc  exact.  From 
Eq.  (3)  we  obtain  the  condition  satisfied  at  apogee  for  any  particular  frequency: 


rt!i{rt)  = r0  cos  ft, 


For  an  isotropic  and  collisionless  medium,  /a  is  defined  by 


S<r)*  1 - 


/*»('> 

f2 


(4) 


(5) 


where  /yy  is  the  plasma  frequency  of  the  medium,  / is  sounding  frequency,  and  ft \2  is  proportional  to  the  electron 
density. 

The  problem,  then,  is  to  determine  the  dependence  of  fyy  on  r that  satisfies  Eqs.  ( i ) and  (2)  for  all  sounding 
frequencies.  We  next  discuss  the  capabilities  and  limitations  of  the  various  methods  of  establishing  that  relationship. 


TRANSFORMATION  BETWEEN  OBLIQUE  AND  VERTICAL  INCIDENCE 


We  consider  here  the  direct  transformation  between  corresponding  oblique-  and  vertical-incidence  ionograms, 
without  explicitly  obtaining  an  electron-density  profile  solution  - that  is,  from  a P\f)  ionogram.  we  deduce  an 
h'v(fv)  ionogram. 

Equations  (1)  and  (2)  are  not,  in  general,  analytically  soluble,  and  tractable  procedure;.  can  be  employed  only 
by  making  certain  approximations.  We  consider  in  this  section  approximations  that  can  be  used  under  appropriate 
circumstances. 


Flat-Earth  Flat-Ionosphere  Approximation 

The  flat-earth  flat-ionosphere  approximation  is  most  easily  reached  by  assuming  that  the  radii  r0  and  rt  are 
much  larger  than  any  other  dimension.  In  this  case  we  obtain  r - r0  = h,  rtlr0  -*■  1,  so  that  Eqs.  (1)  through  (4) 
become 


rnt  da 

0-2co,ft,J  ^_cos!fey<. 

(6) 

_ Ckt  ih 

2 Jo  - cos2  ft,)l/2 

(7) 

p cos  (3  = cos  ft, 

(8) 

~ cos  (3  - 

(9) 
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In  the  flat-earth  flat-ionosphere  approximation,  a simple  equivalence  exists  betwee"  an  oblique  ionogram  and 
the  corresponding  vertical  ionogram.  For  any  oblique  sounding  frequency  f,  w*  can  find  the  equivalent  vertical- 
sounding  frequency,  fv,  that  corresponds  to  reflection  from  the  same  height,  h,.  Comparing  Eqs.  (S)  and  (9),  we 
observe  that,  at  the  altitude  of  reflection  = /)  r,  and 

cos2  ft,  = 1 00) 


fNr  ~ f sin  Po 


Since,  for  vertical  incidence,  f^r  - fv,  then 


fvsf  sin  ft, 


The  virtual  height  corresponding  to  fv  is  found  directly  from  Martyn  s (19C5]  theorem  and  is  exactly  correct  for  a 
fiat  earth  and  a fiat  ionosphere: 


h'v~h  =T  ii!) = T tan No- 


where h'v  is  the  virtual  heigiit  associated  with  a vertical  sounding  at  fv,  and  ti  is  the  heigtit  of  the  equivalent  trian- 
gle associated  with  an  oblique  sounding. 

Comparison  of  Eqs.  (6)  and  (7)  yields  an  explicit  solution  for  cos  ft?  in  terms  of  observed  values: 

D 

ros  (‘4> 


Combining  Eqs.  (12),  (13),  and  (14),  we  obtain 


f id?t 

j 

1(f)’  -o'|" 


where,  for  multiple-hop  paths,  P'  and  D are  per-hop  values.  For  the  flat-earth  flat-ionosphere  approximation,  then, 
an  oblique  ionogram  can  be  reduced  to  a vertical  ionogram  without  introducing  any  a priori  assumptions  regarding 
the  ionospheric  vertical  profile.  The  integral  equation 


h'v(fv) 
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must,  of  course,  still  be  inverted  to  solve  for  the  electron- 
density  profile. 


Curved-Earth  Flat-Ionospheie  Approximation 

The  curved-earth  flat-ionosphere  approximation  is 
reasonably  valid  where  propagation  with'.n  the  ionosphere 
does  not  take  place  over  a large  horizontal  distance.  For 
this  condition,  modified  forms  of  the  flat-earth  flat- 
ionosphere  equations  can  be  used. 

For  a gi  aund  range  D as  shown  in  figure  3,  the 
straight-line  distance  between  the  end  points  is  Ds , 
where 

£>j  = 2r0sin—  (18) 

2 r0 


/ 


and  the  height  above  the  chord  at  the  midpoint  of  D is 


(19) 


Defining  the  argle  a0  to  be  the  are  e between  the  virtual-pain  ray  P '12  ard  the  plane  Ds.  we  can  write 
equations  analogous  to  Eqs.  (6).  (7).  (15),  and  (16)  as 


Thus,  using  Eqs.  (18),  (19),  (22),  and  (23).  we  obtain,  for  the  curved-er.rth  flat  iono.phue  case, 


2 r0  sin  (Dl2r0  V \u2 

: — 7 J | 


(20) 


(21) 


(22) 


(23) 


(.24) 
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(25) 


* 


*i-*‘  *i  h’ ' (2,°  s*'0],,'r‘>(1 ' c“  it) 

where,  as  before,  D und  P'  are  per-hop  values.  It  can  be  seen  that  Eq.  (24)  reduces  to  Eq.  ( 1 S),  and  Eq.  (25)  reduces 
to  IX  (16),  as  P,r0  becomes  very  small. 


Curved-Earth  Curved-Ionosphere 

Exact  , » tionships . ,.e  curved-earth  curved-ionosphere  case  is  that  described  by  Eqs.  (1)  through  (4);  for 

this  case,  tl.cie  is  no  tractable,  analytical  transformation  from  oblique  to  vertical  incidence  without  an  a priori  know- 
ledge of  tht  •*  cetroa-density  profile. 

In  attempting  to  transform  from  oblique  to  vertical  incidence,  the  reasoning  proceeds  along  this  line:  Assume 
that  for  a particular  concentric  ionosphere  of  unknown  profile  an  oblique  measurement  is  made  of  group  path  P\ 
atfre  ,^ncy  /.  over  a distance  D.  Although  P'  may  be  multivalued,  a single  value  of  F'  will  correspond  to  only 
cue  ray  propagated  between  the  end  points.  The  ray  has  associated  with  it  some  apogee,  or  maximum  real  height  of 
propagation  into  the  ionosphere.  A group-delay  transformation  to  vertical  incidence,  then,  requires  the  answers  to 
two  questions:  At  vertical  incidence,  what  sounding  frequency  would  penetrate  to  the  same  height  as  did  the 
obliquely  propagated  ray?  At  vertical  incidence  at  such  an  equivalent  sounding  frequency,  what  would  be  ‘V  expec- 
ted v?.iue  of  virtual  height? 

A partial  answer  to  the  first  question  can  be  given  immediately.  The  equivalent  vertical-incidence  frequency  is 
equal  to  the  ionospheric  plasma  frequency  at  the  ray  apogee.  If  ht  is  the  apogee  height,  then 


fv  ~ 


(26) 


Therefore,  from  Eq.  (5),  we  have 


<2?) 

where  the  refractive  index  p is  dependent  on  the  sounding  frequency  f and  the  plasma  frequency  at  ht.  By  using 
Eq.  (4)  and  noting  that  ht  and  rt  refer  to  the  same  height,  we  can  express  fv  in  terms  of  elevation  angle: 


Equation  (28)  is  exact,  under  the  stated  conditions,  but  from  this  point  we  cannot  proceed  further  without  employing 
approximations. 

In  the  plat-earth  flat-ionosphere  approximation  the  problem  expressed  in  Eq.  (28)  was  solved  by  letting 
r0/rt  approach  unity.  In  this  way  we  not  only  eliminated  the  effect  of  the  unknown  rt,  but  we  also  made  the 
integrands  in  Eqs.  (1)  and  (2)  identical,  so  that  cos  fa  could  be  found  as  equal  to  the  ratio  of  the  measured 
quantities  D and  P'  without  our  having  to  solve  either  of  the  integral  equations  (1)  and  (2).  In  the  curved-earth 
flat-ionosphere  approximation,  we  did  essentially  the  same  thing,  since  the  difference  between  the  two  approxima- 
tions is  described  completely  by  the  trigonometric  relationships  of  Eqs.  (18)  and  (19). 
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The  second  question  is  concerned  with  determining  the  virtual  height  h'v  associated  with  a vertical  sounding 
at  frequency  fv.  Under  previous  assumptions  the  vertical  and  oblique  virtual  heights  were  easily  related  by  an 
equivalent  triangular  path.  But,  for  the  curved-earth  curved-ionosphere  case,  it  can  be  seen  from  Eqs.  ( 1 ) and  ( 2 ) 
that  P and  D (or  Ds)  can  be  related  only  through  a knowledge  of  the  electron-density  profile.  Thus,  the  dif- 
ficulty has  to  do  with  the  existence  of  a right  triangle  with  a base  Dj/2  and  a hypotenuse  P'l 2.  Consider  figure  3 
The  takeoff  angle  a,,  could,  in  principle,  be  measured  for  any  particular  sounding  situation,  and  the  three  measured 
quantities  would  then  be  interrelated  according  to 


A 

COS  Oo  = — 


(291 


so  the  height  hs  could  be  expressed  as 


hs 


P'  Ds 

= — sm  aL>  = tan  aL, 


(30) 


It  can  be  shown,  in  fact,  that  for  a curved  ionosphere  of  finite  thickness.  Eq.  (29)  is  not  in  general  true,  since  a triangle 
that  relates  the  measurable  quantities  cannot  be  constructed.  Consequently,  the  concepts  of  virtual  height  and  equi- 
valent virtual  path  lose  heir  meaning. 

Although  a prof  Je-independent  transformation  is  not  available,  some  authors  [Sm/rh,  1939:  Kobayashi.  1971 J 
have  attempted  to  use  flat-ionosphere  expressions  to  compensate  for  the  curved  ionosphere  through  the  introduction 
of  correction  factc.s.  All  such  correction  factors,  however,  must  make  some  assumptions  about  the  profile  itself  as 
wtll  as  the  angle  of  incidence  onto  the  ionosphere. 

Approximation  errors.  It  has  already  been  stated  that,  for  the  general  case,  an  oblique-to-vertical  transforma- 
tion cannot  be  made  without  knowledge  of  the  ionospheric  layer.  In  an  attempt  to  assess  the  seriousness  of  the 
Hat- ionosphere  approximations,  a particular  model  ionosphere  has  been  assumed.  The  model  is  a quasiparabolic  ( QP) 
layer  defined  oy 


with  parameter  values  /tr.tvx  = 10  MHz,  rm  = 6670  km,  rj>  = 6S45  km,  and  ym  = 125  km.  For  the  particular  values 
chosen,  it  is  intended  to  approximate  a typical  day  time  F layer.  Ionograms  were  calculated  from  the  layer  by  using 
a ray-tracing  program  [Paul  etal.,  1968] . 

The  approximation  errors  associated  with  the  flat-ionosphere  approximations  (fig.  4)  are  obtained  as  follows: 

By  using  the  model  layer  and  the  ray -tracing  program,  ionograms  were  generated  for  vertical  incidence  and  for  ground 
ranges  of  ! 000,  2000,  and  3000  km.  Neglecting  small  errors  in  the  numerical  ray  tracing,  the  vertical-incidence  >ono- 
gram  was  taken  to  U exactly  correct.  Next,  for  each  of  the  three  oblique  ionograms,  equivalent  vertical-incidence 
ionograms  were  cal.uUted,  according  to  the  flat-earth  flat-ionosphere  (Eqs.  (24)  and  (25)]  approximations.  Figure 
4(a)  illustrates  the  two  ap^-oximate  and  the  one  exact  vertical  incidence  ionograms  for  the  1000-km  case.  Figures 
4(b)  and  (c)  illustrates  t!  „-  san  e comparison  for  ground  ranges  of  2000  and  3000  km,  respectively. 


Figure  4 - Typical  errors  resulting  from  various 
oblique-to-verticai  tran-formations. 


The  usefulness  of  the  approximations  depends,  of  course 
here  are  only  illustrative. 


In  an  attemp'  to  correct  the  frequency  error 
introduced  by  ionospheric  curvature,  the  Newbern  Smith 
correction  factor  | .Danes.  1965]  was  introduced  on  the 
longer  paths.  The  points  in  figure  4 indicate  the  result, 
which  in  this  instance  reduced  the  error  to  within  a few 
percent. 

The  comparisons  ll'ustrateG  in  figure  4 lead  to  the 
following  genera!  conclusions 

! . T*  c approximations  are  best  3t  the  shorter 
giour.d  ranges  and  become  progressively 
poorer  as  range  increases. 

7 Neglect  of  ionospheric  curvature  leads  to  a 
derived  election-density  profile  that  has  a 
higher  critical  frequency  than  the  true  pro- 
file. The  error  can  largely  be  corrected,  but 
correction  is  a complicated  function  of  the 
orofile  ; rd  path  length,  and  a simple  factor 
cannot  be  expected  to  work  in  all  instances. 

3.  At  all  ranges,  the  curved-earth  flat-ionosphere 
approximation  is  much  better  than  the  flat- 
earth  flat-ionosphere  approximation. 

4.  The  flat-earth  flat-ionosphere  approximation 
leads  to  a derived  electron  density  with  alti- 
tude higher  than  that  of  the  actual  layer.  For 
this  particular  model  layer,  the  height  error  is 
about  20  km  at  lOOC-km  range,  60  km  at 
2000-km  range,  and  1 10  km  at  3000-km  range. 

on  the  needs  of  the  user,  and  the  particular  results  obtained 


DIRECT  INVERSION  OF  THE  OBLIQUE-INCIDENCE  EQUATION 
FOR  GROUP  DELAY 


Lamination  Methods 

In  spite  of  the  additional  degree  of  freedom  characteristic  of  oblique  incidence,  inversion  methods  have  been 
developed  that  are  not  unlike  most  of  the  approaches  used  at  vertical  incidence.  That  is,  the  icnogram  can  be  inverted 
by  using  numerical  methods  without  first  converting  to  vertical  incidence.  Two  of  these  numerical  methods  use  lami- 
nations of  electi  n density  across  which  a particular  form  is  assumed  for  the  variation  of  electron  density,  a form  that 
will  allow  a closed-form  solution  for  the  virtual  delay  and  the  ground  distance  [Georye,  1970;  Smith,  1970] . These 
quantities  are  then  summed  across  all  the  laminaticns,  for  testing  against  known  or  measured  values  of  D and  P' . In 
calculating  the  plasma  frequency  for  a given  lamination  (height),  the  value  at  the  rsxt  lowest  interval  is  used.  This  pro- 
cedure is  common  to  both  the  lamhrr  methods,  as  well  as  for  vertical  incidence.  Such  a procedure  relies,  of  course,  on 
a lack  of  horizontal  gradients,  since  oblique  incidence,  over  a fixed  distance,  ays  at  different  frequencies  follow 
different  paths. 
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Of  the  two  laminar  methods,  the  more  general  is  that  of  Smith  [1970] . In  this  method  the  takeoff  angle  p0 
and  the  height  of  reflection  rt  are  juggled  until  P‘  and  the  ground  distance  D are  satisfied.  To  make  the  search 
efficient,  limits  on  the  takeoff  angle  are  calculated.  The  variation  with  electron  density  across  a slab  or  lamination  is 
given  by  a hyperbolic  relation. 


fN*  = A - — (32) 

r 

which  allows  a closed  form  for  ground  range  subtended  by  the  distance  the  ray  is  in  the  slab.  The  values  of  A and  B 
are  such  that  fpj1  is  approximately  linear  in  r over  small  increments  in  r. 

Neglecting  horizontal  gradients  and  the  magnetic  field,  the  reflection  height  is  given  from  Eqs.  (4)  and  (S)  as 


rQ  cos  ft, 

[1  ~ fN\rt)ir\v' 


(33) 


where  r0  is  the  radius  of  the  earth.  This  expression  i 


nra/4  in  n/\niiinntinn  4V1  fka  i 


P'  = 


rtD 

rQ  cos  ft, 


to  obtain,  as  in  Eq.  ( 1 5), 


fN=f 


(34) 


(35) 


The  takeoff  angle  0O  must  bo  found  by  interpolation. 

An  example  of  the  accuracy  available  in  this  type  of 
inversion  is  given  by  figure  5 from  Smith  [1970] . Using 
the  electron-density  profile  on  the  left  (t),  he  calculated 
an  oblique  ionogram  by  means  of  ray  tracing.  Points  were 
then  scaled  from  the  oblique  trace  and  an  n version  per- 
formed. The  result  is  plotted  (o)  on  the  vertical  profile. 

It  is  apparent  that  no  downward  reflection  can 
occur  over  the  height  range  where  a profile  is  reentrant. 
While  the  above  ‘straight-line”  approach  can  be  employed 
as  though  no  reentrance  existed,  Smith  shows  how  to 
allow  for  such  profiles  and  what  assumptions  are  required 
for  satisfactory  results.  Since  no  portion  of  *he  oblique 
trace  can  be  directly  associated  with  the  reen  '.rant  region, 
any  information  about  that  region  must  come  from  the 
increase  in  virtual  delay  above  that  height. 


Figure  5.  - Example  of  inversion  of  oblique  ionogram 
using  lamination  technique  [Smith,  1970] . 


George  [1970]  has  suggested  a somewhat  similar  method  that  follows  closely  the  vertical-incidence  work  of 
Jackson  [1956] . In  this  instance  the  angle  of  the  ray  path  is  supplied  more  explicitly  from  an  empiricized  equation 
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D 


(36) 


- sin 


2r0  sin  ( D/2r0 ) 
P'  + e 


lather  than  obtaining  it  by  interpolation.  The  term  e is  a function  of  the  virtual  depth  into  the  layer  at  the  frequency 
of  interest.  Both  of  the  lamination  methods  give  good  results  for  nonreentrant  profiles. 


Analytical  Models  of  Layer  Shape 

Usually  we  think  of  profile  inversion  as  the  process  of  determining  a more  or  less  completely  arbitrary  distribution. 
In  many  circumstances,  however,  a more  restricted  profile  is  still  satisfactory.  For  these  cases,  some  analytical  form  for 
the  electron-densitv  profile  must  be  assumed,  and  the  “inversion”  consists  of  evaluating  the  coefficients  or  scale  factors 
of  the  assumed  expression.  At  oblique  incidence,  two  factors  make  an  otherwise  straightforward  calculation  veiy  diffi- 
cult. First,  there  is  not  just  a single  equation  in  P'(f).  as  at  vertical  incidence,  but  a set  of  coupled  equations  m P’ 
and  D,  which  a.  easily  solved  simultaneously  only  for  the  case  of  a fiat  ionosphere.  Second,  the  accuracy  with  which 
the  record  must  be  scaled  to  obtain  realistic  values  of  layer  parameters  often  exceeds  that  available. 

Probably  the  most  frequently  assumed  layer  shape  for  such  scaling  is  parabolic.  The  three-parameter  parabo'ic 
model  consists  of  a height,  a critical  frequency,  and  a semithickness.  Practice  has  shown  that  unless  data  points  can 
be  scaled  in  the  vicinity  of  the  junction  frequency  or  on  the  high  ray,  the  estimates  of  critical  frequency  will  have  sub- 
stantial error. 

A frequently  used  means  for  mapping  critical  frequency  ove-  wide  geographical  areas  is  the  use  of  ground  back- 
scatter.  Again,  certain  assumptions  are  required  for  the  values  or  constancy  of  layer  height  and  thickness.  An  indica- 
tion of  the  type  of  accuracy  and  range  available  by  using  one-hop  ground  backscatter  is  given  by  Hatfield  [ 1 970] . 

The  use  of  ground  backscatter  data  to  obtain  more  complete  profile  data  has  been  discussed  by  Egan  [ 1 960]  and 
George  [ 1 970] . Measurement  difficulties,  however,  tend  to  limit  the  usefulness  of  the  more  detailed  inversions 
obtained  from  backscatter. 


LIMITATIONS 


Magnetic  Field  Effects 

The  inclusion  of  geomagnetic  field  effects  in  propagation  calculations  makes  tire  equations  far  more  complicated, 
and  for  this  reason  geomagnetic  field  effects  are  usually  ignored  whenever  possible.  One  advantage  of  oblique  ( as  op- 
posed to  vertical)  sounding  is  that  the  normally  higher  operating  frequencies  tend  to  reduce  the  effect  ot  the  geomag- 
netic field.  When  the  geomagnetic  field  is  included,  it  must  be  recognized  that  the  oblique  ray  encounters  continuous 
variations  in  the  magnetic  field  vector  along  its  trajectory.  Since  it  is  difficult  to  treat  changing  field  conditions  and 
yet  retain  some  generality,  the  angle  6 between  the  ray  and  the  magnetic  field  is  assumed  to  be  constant  and  equal  to, 
say,  an  average  value  in  the  vicinity  of  reflection. 

The  detailed  effects  of  the  geomagnetic  field  on  oblique  ionograms  may  be  found  in  Kopka  and  Moller  [19686] 
for  varying  path  orientations  and  locations. 

Another  source  of  error  due  to  the  magnetic  field  is  the  lateral  displacement  of  the  ray  from  the  no-field  trajec- 
tory. Titheridge  [1959]  has  examined  this  error  and  concluded  that  later?)  displacements  of  the  order  of  lo  km  can 
occur  only  at  frequencies  below  5 MHz. 
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Horizontal  Gradients  in  Electron  Density 

Perhaps  the  greatest  limitation  on  the  accuracy  of  inverting  oblique-incidence  ionograms  is  the  lack  of  constancy 
of  the  electron-density,  profile  over  the  reflection  region.  The  rays  are  reflected,  of  course,  in  a three-dimensional 
distribution  of  electron  density,  which  cannot  normally  be  described  by  a single  vertical  profile.  Although  one  might 
expect  to  perform  the  inversion  after  some  correction  for  known  horizontal  gradients,  such  a procedure  is  difficult 
to  employ  (not  to  mention  that  the  gradients  are  usually  unknown).  The  inclusion  of  such  gradients  would  no  doubt 
require  tedious  trial  and  error  using  ray-tracing  techniques. 

If  we  cannot  treat  the  gradients  quantitatively,  we  can,  however,  examine  the  type  of  error  they  might  produce 
in  a typical  profile.  Generally  speaking,  the  maximum  oblique  frequency  tends  to  be  determined  near  the  horizontal 
region  having  the  maximum  electron  density.  For  example,  for  a simple  linear  gradient  of  critical  frequency,  the 
maximum-frequency  ray  will  be  reflected  nearer  the  end  of  the  path  with  the  higher  density.  Furthermore,  the  max- 
imum frequency  will  exceed  the  maximum  frequency  that  would  be  observed  if  the  average  critical  frequency  existed 
all  along  the  path.  Conversely,  a simple  linear  gradient  in  height  will  reduce  the  maxime' . frequency.  Kopka  and 
MoUer  [1968a]  examined  some  specific  cases  of  each  type  of  gradient. 


Absolute  Time  Delay 

There  are  roughly  three  techniques  for  measuring  absolute  path  delay  by  using  oblique  ;ncidence  sounders.  The 
first,  and  probably  the  most  accurate,  is  a simultaneous  duplex  pulse  transmission  along  the  path.  The  average  of  the 
time  delays  measured  at  each  receiver  is  the  exact  path  length  - correct  to  the  accuracy  of  the  measuring  equipment. 
The  use  of  a transponder  falls  in  this  category. 

The  second  method  is  the  use  of  “constant”  time-delay  modes,  such  as  those  - opagated  via  the  normal  E layer. 
This  method  has  the  obvious  disadvantage  that  the  £-4ayer  trace  is  not  always  detectable,  especially  over  long  paths; 
also,  it  is  subject  to  f-layer  height  variations,  which  become  more  pronounced  when  the  height  variation  of  Es  is 
included.  The  accuracy  achievable  with  this  method  is  on  the  order  of  100  usee,  depending  on  the  length  of  the  path. 
For  a 3100-km  path  (2£  ),  the  variation  in  delay  is  100  usee  as  the  Zf-layer  height  is  varied  from  90  to  130  km. 

The  third  method  is  the  use  of  separate  stable  oscillators.  This  method  requires  that  the  standards  of  the  trans- 
mitter and  those  of  the  receiver  be  in  phase  or  that  their  clocks  be  exactly  synchronized.  Tne  error  in  measurement 
will  be  the  time  difference  existing  between  these  clocks  or  the  error  in  the  measurement  apparatus,  whichever  is 
greater.  If  the  actual  time  difference  between  clocks  is  unkno.'m,  the  rath  delay  will  be  no  more  accurate  than  the 
computed  delay  used  to  establish  the  original  synchronization. 


CONCLUSIONS 


The  inversion  of  oblique-incidence  ionograms  can  be  achieved  only  under  rather  restrictive  limitations  or 
assumptions.  The  most  serious  of  these  restrictions  is  the  absence  of  horizontal  gradients  in  electron  density.  The 
normal  curvature  of  the  ionosphere  may  be  allowed  for  by  using  correction  factors  in  the  oblique-to-vertical  trans- 
formation methods  or  by  using  the  lamination  methods.  With  the  assumption  of  a horizontally  stratified  ionosphere, 
many  approaches  are  possible,  including  oblique-to-vertical  transformation  or  direct  inversion  of  the  oblique  trace. 
The  inaccuracies  due  to  the  magnetic  Held  and  to  uncertainty  ir.  absolute  time  delay  lead  to  smaller  errors  for  most 
situations.  With  care  in  time-delay  scaling  and  in  the  selection  of  oblique-to-vertical  transf  jrmation  methods,  a 
vertical-incidence  ionogram  can  be  derived  that  will  accurately  portray  the  electron-density  profile  if  the  horizontal 
gradients  are  small  over  the  reflection  region. 
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Chang:  In  your  flat-earth  ionosphere  approximation,  the  profile  you  got  was  far  from  the  true  one.  Could  you  use 

the  modified  refractive  index  concept  to  go  from  that  profile  to  the  original  profile?  Booker  [1946] , used  this 
concept. 

Watt:  That’s  similar  to  the  method  of  Newbern  Smith. 

Reinisili.  The  functions ' ou  gave  for  h'  seem  to  ali  be  single  valued;  for  example  for  one  frequency  there  is  one 
time  delay,  while  the  model  oblique  lonograms  show  the  typical  nose.  The  Peterson  ray  is  attributed  to  the 
higher  trace.  How  have  you  taken  care  of  that  ’ 

Watt:  The  h'  functions  occur  alter  transforming  from  oblique  U>  vertical  incidence  and  are  single  valued. 

Unidentified  speaker:  Is  the  horizontal  gradient  problem  alleviated  if  you  have  several  receivers? 

Watt:  You  can  learn  more  about  horizontal  gradients,  but  1 don’t  have  a quantitative  feel  for  the  extent. 

Re.nisch:  Would  pulses  help  to  separate  the  multipath  modes  compared  to  a chirp  sounde, ' 

Watt:  You  can  separate  multipath  modes  with  chirp  sounders. 
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5.  PARTICLE  SCATTERING: 


CLASSICAL  AND  QUANTUM  MECHANICAL 


This  chapter  is  concerned  with  the  scattering  of  particles,  either  by  each  other  or  by  some  scattering  center. 

In  contrast  to  the  previous  chapters,  the  probing  “signals”  are  particle  beams,  at  least  for  the  classical  case,  and  the 
measured  data  are  the  number  of  scattered  particles  emerging  per  unit  time  in  a given  direction,  per  unit  solid  angle. 
The  aim  of  the  inverse  problem  is  to  infer  the  interparticle  force,  or  potential  energy,  as  a function  of  distance  from 
the  scattering  center. 

In  general,  the  problem  here  is  more  complex  and  difficult  than  those  in  the  other  disciplines.  The  mathe- 
matical details  and  inversion  techniques  are  significantly  more  sophisticated.  Curiously,  the  laboratory  experimen- 
tal phase  of  this  work  is  not,  to  date,  as  sophisticated  as  the  theoretical  work,  but  the  mathematical  methods  de- 
scribed here  have  been  applied  to  several  other  inversion  disciplines,  in  particular  to  electromagnetic  scattering 
(Chapter  6)  and  to  seismology  (Chapter  7). 

R.  G.  Newton  organized  and  chaired  the  session  devoted  to  particle  scattering. 


■ ■ t .w- 


INTRODUCTION  TO  THE  INVERSE 
QUANTUM  SCATTERING  PROBLEM* 

\ 

Roger  G.  Newton 

’ Indiana  University 

ft  4 _ 

| NT3-11608  ABSTRACT 

\ 

The  averse  scattering  problem  is  introduced  in  the  context  of  the  Schrodinger  equation.  The  emphasis  is  on 
; ,he  GeD'tti  Levitan  approach  to  the  nonrelativistic  problem  at  fixed  angular  momen'um. 

'-i 

INTRODUCTION 


: < 
. -;:S 


■V  I 


The  experimental  situation  under  discussion  in  this  chapter  is,  schematically,  the  following.  A colimated  beam 
of  particles  of  given  momentum  is  directed  at  a thin  foil  of  target-particles  that  may  be  considered  at  rest.  The  force 
exerted  by  the  target  on  the  beam  particles  deflects  the  latter  from  their  original  direction,  and  they  emerge  at  various 
angle  where  they  are  counted  by  detectors.  Thi  atio  of  the  number  of  scattered  particles  emerging  per  unit  time  in 
a give  i direction,  per  unit  solid  angle,  to  the  incident  flux  and  to  the  target  density,  is  called  the  differential  scattering 
cross  sechon.  It  is  theoietically  most  directly  described  in  the  center-of-mass  coordinate  system  of  the  missile  and 
target  particles,  rather  than  in  the  laboratory  system  in  which  initially  the  target  is  at  rest.  The  two  are  connected  by 
a simple  transformation. 

The  dynamics  of  the  " eraction ! ? tween  the  beam  and  target  particles  may  be  described  by  classical  mechanics 
if  the  beam  energy  is  high  -nough  and  not  sufficiently  well  defined,  or  more  generally  by  quantum  mechanics.  In 
either  case  :he  aim  of  the  inverse  problem  is  to  infer  from  the  scattering  data  (the  differential  cross  section)  the  inter- 
particle  fo.ee,  or  potential  energy,  as  a function  of  distance  (the  “profile”).  The  “profile”  in  this  case  is  not  generally 
expected  to  exhibit  as  much  detail  and  small-scale  structure  as  in  the  cases  discussed  in  previous  chapters.  On  the 
other  hand,  the  path  from  the  data  to  the  function  sought  is  very  much  longer  and  the  inference  more  indirect.  As  a 
consequence,  the  mathematical  apparatus  needed  is  much  more  sophisticated. 

The  invei  se  scattering  problem  in  classical  mechanics  was  treated  in  a paper  by  Keller  et  al.  [1956]  and  it  leads 
to  Abel’s  integral  equation,  v'lich  we  have  already  seen  a number  of  times.  In  view  of  the  formal  similarity  between 
classical  mechanics  and  ray  optics,  the  appearance  of  Abel’s  equation  in  both  is  no  surprise.  Here  we  turn  directly  to 
the  quantum  meeb' .-icj  dynamics,  and  to  simplify  matters,  we  shall  restrict  the  discussion  to  particles  without  spin 
at  nonrelativistic  vei  xities. 


QUANTUM  - MECHANICAL  DYNAMICS 


The  scattering  of  a beam  of  spinless  particles  by  a rotationally  symmetric  center  of  force  is  described  by  the 
time-independent  Sc’'roidinger  equation,  which  in  appropriate  units  reads 


•Work  S' 'poor  ted  in  part  by  the  National  Science  Foundation  and  the  U.S.  Army  Research  Office,  Duriam, 
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[-VJ  + K(r)]  4/  = E\p 

where  V2  is  the  Laplace  operator  in  three  dimensions,  E is  the  energy  of  the  particle,  and  V(r)  is  the  potential 
energy,  assumed  spherically  symmetric  and  rapidly  decreasing  to  zero  at  large  r.  The  asymptotic  form  of  the  solu- 
tion \p  for  lar(  ? t is 


4 / ~-e'k‘r  + r 1 f( k',k) 


where  hk  is  the  momentum  of  the  particle,  and  k'  = kt/r.  (We  write  r for  Iri.)  The  connection  between  the 
energy  and  the  value  of  k = ikl  is  (in  our  units)  E ~ k2 . You  may,  of  course,  equally  well  think  of  E as  the 
frequency  of  the  wave,  and  of  k as  the  wave  vector,  with  the  wave  length  X = 2n/k. 

The  measured  differential  cross  section  for  scattering  from  a beam  in  the  direction  k to  a counter  in  the 
direction  k'  is  given  by 


do  , i 

— = l/(k',k)l 
ds2 


Owing  to  the  rotational  invariance  of  the  scattering  center,  f depends  only  on  E and  on  the  angle  0 between  k 
and  k': 


/(k',k)  = A(E,x)  x = cos  0 - 


The  usual  procedure  is  to  solve  the  Schrodinger  equation  in  some  way-exactly,  numerically,  or  approximately- 
to  extract  f from  , a;  d thereby  to  calculate  the  cross  section.  The  inverse  scattering  problem  is  posed  by  assum- 
ing do/dSi  to  be  ziven  as  a function  of  E , or  x,  or  both,  and  then  to  ask  whether  there  is  a (reasonable)  function 
V(r)  that  produces  ;t  via  the  Schrodinger  equation,  and  if  so,  whether  it  is  unique  and  how  it  can  be  calculated. 

Assuming  do/dfi  to  be  given  as  a function  of  the  angle  or  x for  0 < x < 1,  the  first  problem  is  to  deter- 
mine the  complex  function  A (E,  x)  from  a knowledge  of  its  absolute  magnitude.  The  most  direct  attack  on  this 
problem  is  made  by  means  of  the  so-called  “generalized  optical  theorem,”  or  unitarity  of  the  S matrix.  It  follows 
from  the  conservation  of  fiur  implied  by  the  Schrodinger  equation  that 


Imf{ k\k)  = — ( dSmk'',k')/(k’',k) 

4tr  J 


where  dfl"  is  the  solid-angle  clement  of  k”,  and  Im  denotes  the  imaginary  part  of  the  complex  function.  If  we 
write  (at  fixed  E ) 


kA  (E,  x)  = F(x)e^(x)  0 < F(x) 


then  this  equatior  reads 
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F(y)F(z) cos  [0(y)  - 0(z)] 


F(x)sin  <p(x)  = — 


J 


dydz 


(1  -a:2  -y2  -z2  + 2xyz) 


and  the  integration  extends  over  the  interior  of  the  ellipse  where  1 -x2  -y2  -z2  + 2xyz  > 0.  This  equation  may 
be  regarded  as  a nonlinear  integral  equation  for  sin  <f>(x)  if  F(x)  is  given.  [Martin,  \969; Newton,  1968],  Both 
Schauder’s  fixed-point  theorem  and  the  principle  of  contraction  mapping  have  been  applied  to  this  problem,  with 
the  following  result.  If 


}F{y)F(z) 

dy  dz  r ■ - , — < 1 

27tF(x)(l  - x2  -y2  - z 2 + 2 xyz) 

for  all  x,  0 < x < 1,  and  F(x)  satisfies  a Holder  condition,  then  a solution  sin  0(x)  exists.  If  Q(x)  < 21'2 , 
the  solution  is  unique  and  may  be  obtained  by  iteration.  [It  was  shown  by  Martin,  1969,  that  for  Q{x)  < 0.79  the 
solution  is  unique.]  It  is  also  known  from  explicit  examples  [Crichton,  1966]  that  for  some  F(x)  with  Q(x)  > 1 
a solution  exists  and  is  not  unique.  The  general  situation  for  Q(x)  is  unknown.  It  is  also  not  known  whether  the 
gap  between  2I/2  (or  0.79)  and  1 is  real  or  merely  a technical  difficulty.  For  further  remarks,  see  Fftimiu  [1970], 
Goldberg  [1970] , and  Gerber  and  Karplus  [1970, 1971] . 

Let  us  assume  now  that  the  scattering  amplitude  A (E,  x)  has  been  obtained  from  the  experimental  data.  We 
may  expand  its  dependence  on  x in  a Legendre  series: 

OO 

A{E,x)  = ai(E)Pt(x) 

E=0 

in  which  the  Pg(x)  are  Legendre  polynomials.  The  principal  virtue  of  this  expansion  is  that  it  allows  us  to  incor- 
porate the  restriction  imposed  on  A (£,  x)  by  the  generalized  optical  theorem  in  a very  simple  manner:  The 
coefficients  ag  must  have  the  form 


ae  = (26+  l)(/*r*  (em*) 

where  6g  is  & real  number.  It  is  called  the  Eth  phase  shift  and,  of  course,  depends  on  E.  To  s<y  that  A(E,x)  is 
given  is  therefore  equivalent  to  saying  that  the  phase  shifts  are  given. 

At  this  point  the  inverse  problem  may  be  posed  in  two  different  ways.  The  first  is  to  assume  that  a single 
phase  shift  6g(£)  is  given  as  a function  if  E,  for  all  positive  E.  The  second  is  to  suppose  that  at  one  value  of  E 0, 
all  phase  shifts  are  given.  The  first  was  attacked  much  earlier  and  there  is  much  more  literature  on  it  (see  appendix) 
than  on  the  second,  even  though  from  a practical  point  of  view  the  second  way  of  posing  the  inverse  scattering  problem 
is  of  greater  interest.  I shall  discuss  only  the  first,  the  so-called  “inverse  scattering  problem  at  fixed  angular  momentum." 
The  second,  the  “inverse  scattering  problem  at  fixed  energy,”  is  discussed  by  Sabatier  in  the  next  paper. 

Before  proceeding,  there  is  a more  general  remark  that  should  be  made.  It  would  be  most  natural  to  ask: 

What  if  A ( E , x)  were  given  as  a function  of  both  E and  x.  Or,  what  if  all  phase  shifts  were  given  as  functions  of 
E for  all  positive  £?  We  shall  see  that,  at  least  within  a certain  rather  large  class  of  potentials,  a single  phase  shift 
as  a function  of  £ determines  an  n-parameter  family  of  potentials  (with  n <*>).  Hence  the  prescription  of  6g(£) 
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for  one  i determines,  for  each  at  most  an  n-parameter  family  of  phase-shift  functions  fig'(E).  Thus  while 

5g(£"),  for  any  one  { can  be  prescribed  (within  wide  lim’ts)  arbitrarily  as  a function  of  E , two  phase  shifts  of 
different  i values  are  very  much  restricted  in  their  possible  E dependence  by  the  requirement  that  they  be  pro- 
duced by  the  same  potential. 

A similar  situation  exists  for  the  inverse  problem  at  fixed  energy.  As  Sabatier  notes,  a set  of  phase  shifts  at 
one  energy  determines  a certain  class  of  potentials.  A set  of  phase  shifts  at  another  en.  gy  also  determines  a class 
of  potentials.  The  intersection  of  these  two  classes,  for  arbitrarily  prescribed  sets  of  phase  shifts  is  not  krown,  but 
the  chances  are  that  with  probability  1 this  intersection  will  be  zero.  In  other  words,  when  all  6 g(£  ) are  given  for 
all  £,  a horrendous  existence  problem  arises.  For  this  reason,  one  usually  assumes  either  all  6g  atone  E,  or  one 
fig  for  all  £. 


THE  INVERSE  PROBLEM  AT  FIXED  ANGULAR  MOMENTUM 


We  are  concerned  here  with  one  particular  general  approach  to  the  inverse  problem  at  fixed  angular  momentum. 
There  is  another  class  of  methods  more  suitable  to  the  extraction  of  partial  information  from  partial  input,  rather  than 
from  the  full  knowledge  of  a 5 g(£)  for  all  positive  E as  discussed  by  Calogero  in  a later  paper. 

If  the  solution  0(r)  of  the  Schrodinger  equation  is  subjected  to  Legendre  expansion  of  its  directional  depen- 
dence on  r relative  to  the  direction  k of  the  incoming  beam 


*(r)  = 


5^  (2C+  l)r~'  ^(r)Pg(k-r) 


then  the  radial  function  0g(r)  must  satisfy  the  radial  Schrodinger  equation 

-0S”  + C(C+  !)£“*  0g  + K(r)0g  = £0g 


which  is  an  ordinary  linear  homogeneous  second-order  differential  equation  with  a regular  singular  point  at  r = 0. 
(It  is  assumed  that 


Urn  r2  V(r)  = 0 
r-0 


Otherwise,  this  statement  would  not  necessarily  be  true.)  There  are  two  classes  of  solutions:  The  regular  solutions 
and  the  irregular  solutions  (all  with  respect  to  the  point  r = 0).  A regular  solution  is  uniquely  determined  by  a 
boundary  condition  at  the  origin-for  example, 

Urn  r-£_1  0n(r)(28  + 1) !!  = 1 
r-0 


For  fi  * 0 (the  so-called  “1  wave”)  to  which  we  shall  for  simplicity  from  now  on  restrict  ourselves,  this  means 


0(0)  = 0 0'(O)  = 1 
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Every  other  regular  solution  is  a multiple  of  this  one.  This  includes  the  physical  radial  wave  function.  The  asymptotic 
behavior  of  a regular  solution  for  large  r is  given  by 


a.  A sin  (At  + 6) 


where  6 is  the  (s  wave)  phase  shift. 

In  this  context,  then,  the  inverse  scattering  problem  is  posed  by  assuming  the  asymptotic  phase  6 (£)  ot 
0(r)  given  for  all  positive  E,  to  determine  V(r)  in  the  radial  Schrodinger  equation.  Mathematically,  therefore, 
it  is  an  inverse  Sturm-Liouville  problem  on  a semi-infinite  interval  0 < r < “ . We  are  given  asymptotic  infor- 
mation about  the  regular  solution,  as  a function  of  the  spectral  parameter,  and  we  are  asking  for  the  reconstruction 
of  the  differential  equation  (in  a specifically  restricted  class). 

The  attack  on  this  problem  involves  two  essentia!  steps.  As  solutions  of  a Sturm-Liouville  problem  on  a semi- 
infinite interval  the  functions  <p(E,  r)  form  a complete  set.  This  statement  may  be  written  in  the  symbolic  form  of 
a Stieltjes  integral 


f~ dp  (E)<t>{E.r)<t>  (£,/)  - 6 (r-r) 

*-oo 


The  spectral  function  p(£)  has  a finite  number  of  discontinuities  at  the  discrete  (point)  eigenvalues  of  the  Strum- 
Liouville  problem  (at  negative  values  of  E)  and  it  is  continuously  increasing  for  £ > 0.  The  point  eigenvalues  are 
the  bound  states  of  the  system,  and  if 


dr  r\V\ 


< °° 


then  their  number  is  finite.  Furthermore  they  cannot  be  positive.  The  positive  spectrum  is  absolutely  continuous. 
The  two  steps  in  the  inverse  problem  are  0 ) 6 (£)-*•  p(£),  and  (2)  p(£)  -+  F(r). 


Let  us  define 
integral  equation 


Stepl:  S(£)  -*•  pi£) 

X(£,  r)  to  be  a regular  solution  of  the  radial  Schrodinger  equation  that  satisfies  the  scattering 


X'Xv+GtVx 


where 


XgiE)  = k 1 sin  At 

and  Gq  is  the  resolvent  of  the  free  Hamiltonian  operator  on  the  upper  rim  of  its  cut, 
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Go(F)  - (E  + io  • H0) 


Hn  = 


-dJ 

dr1 


Its  kernel,  the  free  Green's  function,  can  be  written  down  explicitly, 

GpiE;r,r')  = ~k~ 1 sin  kr<e‘*r> 


where  = min(r,  r)  and  r>  - maxf,  r).  for  the  bound  states  En  we  will  assume  x to  be  normalized  to  unity: 

f °°  I 1 2 

JQ  dr  |xnW|  = 1 


Now,  since  x smd  0 are  both  regular  solutions  of  the  radial  Schrodinger  equation,  they  must  be  linearly 
dependent:  0(r)  = fx(r).  From  the  integral  equation  for  x one  easily  finds  that  as  r -*  0 


xW  ~ r 


1 - 


/ d reikr' 


V(r’Mr') 


ermparison  of  which  with  the  boundary  condition  for  <p(r)  shows  that 


/ = 1+  I d r eikr  V(ryt>(r) 

'o 


On  the  other  hand,  as  r **  » we  find  from  the  integral  equation 


xW~ 


2 ik 


JT 


1-2/  I dr  sin  kr'  V(r')x(r') 

f 0 


eikr  _ e~ikr 


and  consequently,  as  r ■ 


X(f)~ 


because  0(r),  being  the  solution  of  a real  differential  equation  with  real  b<  ndary  conditions,  is  real  (for  real  k). 

It  follows  that  fTif  - that  is,  arg/  = -6 . The  function  f[E)  is  calieu  the  Jost  function  and  it  ran  be  shown 
to  be  equal  to  the  Fredholm  determinant  of  the  integral  equation  for  x 
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On  the  other  hand,  the  completeness  of  the  spectrum  of  the  Sturm-Liouville  operator  H = -d2  /dr2  * V(r)  is 
expressed  in  terms  of  x as  follows: 


dEkx(E,r)\*{ES)  = 6(r-r') 


if  En  are  the  point  eigenvalues  of  H.  Comparison  with  the  spectral  resolution  in  terms  of  0 shows  that  therefore 


where 


dp(£) 

d£ 


k_  \ 

» r 1 


N 

HE-En) 

n-1 


E>  0 


E<  0 


Nn'  = ft  d r[0(£„,r)]J 


Thus,  the  connecting  link  between  the  spectral  function  p(jE)  and  the  asymptotic  phase  8(E)  is  the  Fredholm 
determinant  (or  Jost  function)  /.  The  crucial  property  cf  /(£)  that  allows  us  to  determine  l/(£)l  from  a knowl- 
edge of  its  phase  ~6  (E)  is  its  analyticity  and  asymptotic  behavior. 

Since  the  resolvent  Go(E)  of  H0  is  an  analytic  (operator-valued)  function  of  E regular  everywhere  in  the 
E plane  cut  from  0 to  °°  and  tends  to  zero  there  at  infinity,  f(E)  is  an  analytic  function  of  E regular  in  the  cut 
E plane  and  tends  to  1 there  at  infinity.  Furthermore,  f(E)  has  the  reality  property 


r = f(E~io)  = r(E  + io) 


Therefore,  if  / has  no  zeros,  its  logarithm  satisfies  a “dispersion  relation” 


log  I /(£) I = —P 
rr 


dE1 6(E) 
E-E 


(i.e.,  a Hilbert-transform  relation  between  its  real  and  imaginary  parts)  where  P denotes  Cauchy’s  principal  value  If, 
on  the  other  hand,  f(E)  has  zeros  in  the  complex  plane,  then  these  are  the  point  eigenvalues  En  of  H.  In  that  case 
we  form  the  function 


/ (E)  s /(£)  /7  “ 

k-kn 

which  has  no  zeros  and  hence  satisfies  the  above  Hilbert  transform  relation.  We  are  therefore  free  to  prescribe  inde- 
pendently both  the  bound-state  energies  Et t,  and  the  phase  shift  6(£),  and  we  can  calculate  \f(E)  I explicitly. 


5-8 


Thus  the  positive  energy  part  of  the  spectral  function  p(E)  is  determined  uniquely  by  the  phase  shift  6 (E) 
and  the  bound  states  together.  The  negative-energy  part  then  still  contains  a positive  normalization  parameter  for 
each  bound  state.  Thus,  prescription  of  6(E)  and  of  the  N bound-states  determines  an  W-paiameter  family  of 
spectral  functions. 


Step  2:  p(E)  - V(r) 

We  now  turn  to  the  second  step,  from  p(E)  to  V.  By  a method  similar  to  the  proof  of  the  completeness  of 
the  functions  0,  one  can  show  that  if  V^' ) is  an  arbitrary  comparison  potential  and  0^')  and  p(')  are  the  corre- 
sponding regular  solution  and  spectral  function,  then  there  is  an  integral  representation  of  <t>(E,r)  of  the  form 


<t>(E,r)  = 


0(l>(£»  + 


d r' K(r,r)4'\E,r) 


where 


K(r,  r)  = J Ah(E)<t>(E,  r)0(l  \e.  r) 


with 


h(E)  - p^(E)  - p(E) 


It  is  easily  seen  that  the  kernel  K(r,  r)  satisfies  the  partial  differential  equation 


Lar2 


- F(r)J  /f(r,  r')  = > (r  )J  JC(r,  r ) 


One  then  verifies  by  means  of  this  equation,  the  Schrodinger  equations  for  0 and  0^*  \ and  of  the  above  integral 
representation,  that  K(r,r')  must  satisfy  the  boundary  conditions 


2 — K(r.r)  = V(r)  - K<‘ ) (r)  K'r.Q)  = 0 
dr 


Next  one  inserts  the  integral  representation  for  0 in  the  definition  of  K(r,  r'),  and  one  obtains  a linear  integral 
equation  for  K(r,r), 


K(r,r)  = g(r,  r)  + 


f dr"ti(r,r" 

A> 


)g(r"S) 
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where 


g(r,  r)  * J d h(E)4>('  > (£,  r)*<‘  > (£  r) 


This  is  the  Gelfand-Levitan  equation.  It  can  be  shown  under  very  general  conditions  to  have  a unique  solution.  Thus 
the  step  from  p to  V is  complete,  as  follows. 

Adopting  an  arbitrary  comparison  potential  K^1 ) (for  example,  V^1  ^ * 0),  we  calculate  gi>\  r)  from  the 
given  p(£).  Then  the  Geliand-Le,'itan  equation  must  be  solved,  and  from  K (r,  r)  we  obtain  K(r)  uniquely. 

We  summarize  the  inversion  procedure  schematically  in  the  following  way: 


(1)  (2)  (3)  (4) 

do 

* A(E,  cos  6)  -*  6g (£)  -*  pg(£)  -»  V(r) 


in  the  first  step  existence  and  uniqueness  are  assured  so  long  as  the  cross  section  is  smooth  and  small  enough.  Step  (2) 
is  straightforward  and  unique.  In  step  (3)  existence  is  assured  by  mild  conditions  on  the  functions  6g(£),  a d it  is 
unique  if  there  are  no  bound  states  of  angular  momentum  8 . Otherwise,  the  binding  energies  can  be  assigned  inde- 
pendently, and  there  is  a “phase  and  binding-energy  equivalent”  family  of  spectral  functions  pp(£),  with  as  many 
parameters  as  bound  states.  Step  (4)  is  unique,  under  mild  restrictions,  and  existence  is  assured. 

The  final  question  that  arises  is  whether  we  a' . assured  that  the  cycle  is  always  doted.  In  other  words:  We  start 
with  a certain  class  1 of  potentials  and  prove  certain  properties  of  the  phase  shifts- say,  that  they  are  in  class  2.  If  we 
now  start  with  any  phase-shift  function  8 g(£)  in  class  2,  the  inversion  procedure  leads  to  potentials  in  some  class  1'. 
We  would  call  the  procedure  closed  if  the  class  2 were  known  f<  * which  1' 1 . It  would  be  optimally  closed  if  the 
class  1 were  known  for  which  2 is  such  that  1'  = 1 . Tin;  an*'  'tr,  -o  these  questions  are  not  known. 
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APPENDIX 


The  first  attempts  at  a solution  of  the  inversion  problem  here  discussed  were  made  by  Froberg  [1947-31 ) and 
Hylleraas  [1948] . It  was  then  shown  by  Bargmann  [1949]  that  the  phase  shift  alone  does  not  necessarily  uniquely 
determine  the  potential.  The  uniqueness  question  was  resolved  by  Levinson  [1949] , Marchenko  [1950] . and  Borg 
[1949]  (see  also  Borg  [1946] ).  The  previous  work  for  Froberg  and  Hylleraas  was  amended  in  the  light  of  the  new 
knowledge  by  Holmberg  [1952] . New  construction  procedures  were  then  invented  by  Jos  and  Kohn  [1952] . For 
motivation  see  also/ost  [1947]. 

Meanwhile  the  Geltand-Levitan  equation  had  been  introduced  into  the  inverse  Sturm-Liouville  problem  by 
Gel’fand  and  Levitan  [1951],  It  was  th*i  applied  to  the  inverse-scai  {“ring  r oblem  by  Jost  and  Kohn  [1953]  and 
Levinson  [ 1 9.53] . Since  that  time  the  problem  has  given  rise  to  a rather  extensive  Russian  mathematical  literature 
Chudov  [1949  2nd  ;9i6] ; Krein  [1951-58]  \Marchenko  [1952-60] ; Berezanskii  [1953-58]  ;Block  [1953]  \Firsov 
[i'o3j ; Stashevskaya  [1953] ; Volk  [1953] ; Neigauz  [1955] ; Levitan  [1956-64]  ,Fadu^er  [1956-64] ; Agranovich 
and  Mcrtuenko  [19 57 -63];  Gohberg  and  Krein  [1958] ; Gasymov  [1963]  \F^starev  [1964] ; Lavrentev  [1964  and 
1956]  Levitanand  Gasymov  [1964] , Kac  [1965]  ,Gugushvili  [1970] . See  also  Ambarzi  mian  [ 1 929] , Jauho 
[1950J;Owm  [ 1 9551 ; A'a^  [1955  and  1961]  ,Ohmura  [1956];./osf  [1956];  wton  [1956]  \ Friedman  [1957]; 
Tietz  [1959] . Levitan  and  Sarksyan  [ 1 960] ; Martin  [ 1 96 1 j ; Petras  [1962] ; Blazek  [1962-66]  .Hyl'eraas  [1964]; 
Pearce  [!964 ],Ramm  [1964] ; Ljance  [1964  and  1 966] ; Swan  and  Pearce  [ 1 966] ; Lax  and  Phillips  [ 1 960 j ; 
Portinari  (1966  and  1967]  \Prosse,  [1969] ; O'Brien  and  Bernstein  [1969] ; Newton  [1966  and  1970] . 

A more  genera!  approach  started  with  the  inversion  of  the  electromagnetic  reflection  problem  and  led  to 
application  in  the  full  three-dimensional  Schrodi.iger  equation;  Kay  and  Moses  [1955-61] ; Moses  [1956] ; see  also 
Faddeev  [1966], 

Generalizations  of  the  Gellfand-Levitan  ai.J  Marchenko  procedures  to  coupled  equations  were  given  by  Newton 
and  Jost  [1955] ; and  Krein  [1956] ; to  the  scattering  of  spin  1/2  particles  with  a tensor  force  present  by  Newton 
,;955];  Agranovich  and  Marchenko  [1958];  Fulton  and  Newton  [ 1 956] ; Newton  and  Fulton  [1957] ; (see  also 
sec.  9 of  Newton  [1960] );  to  the  Klein-Gordon  equation  by  Corindaldesi  [1954] ; to  the  Dirac  equation  by  Prats  ard 
Toll  [1959] ; Gasymov  and  Levitan  [1966] ; and  Gasymov  [1968] ; to  both  relativistic  equations  by  Verde  [1958] ; 
to  coupled  channels  by  Cox  [1962] ; to  separable  potentials  by  Gourdin  and  Martin  [1957  and  1958] , Ondan  [1958 
and  1967] ; Bolsterli  and  Mackenzie  [1965] ; Mills  and  Reading  [1969] ; Fiedeldey  [1969]  ;and  Tabakm  [1969] ; 
to  energy-dependent  potentials  by  Malcenko  [1966] ; and  to  charged  particles  by  Swan  [1967] . 

For  some  applications,  see  Newton  [1957] , Gardner  et  al.,  [1967]  ‘.Buck  [1969 ] \ Fiedeldey  [1970) ; Fuda 
[1970]  ;Srivastava  [1970]  .Child  [1970] . 
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DISCUSSION 


Unidentified  speaker:  This  development  is  a very  pretty  theoretical  one.  This  inverse  problem  demands  that  you 

know  the  spectral  weight  function  for  all  values  of  the  energy,  but  when  you  measure  it  in  the  physical  labora- 
tory you  are  bounded  by  practicle  situations.  How  stable  is  this  process  against  uncertainties  or  incompleteness 
in  the  data'* 

Newton:  Net  much  can  be  said  about  that.  Of  course,  you  know  the  data  only  up  to  a finite  energy  and  then  not 

exactly.  There  are  some  things  that  can  be  said  about  the  stability  of  the  process  with  resoect  to  small  pertur- 
bations at  a finite  energy,  but  the  main  limitation  of  the  method  is  that  you  have  to  know  the  phase  shift  up 
to  infinite  energy. 

Unidentified  speaker:  c 'ppose  you  know  it  only  to  a finite  energy,  what  class  of  potential  can  you  fit  by  a class  of 

phase  shifts  that  is  given  only  up  to  a finite  energy? 

Newton:  That’s  a difficult  question  and  not  nv,  Ji  is  known.  What  you’d  like  to  know  are  certain  specific  features 

of  the  potential.  How  does  it  behave  asymptotically?  Can  you  tell  whether  the  potential  goes  down  exponen- 
tially for  large  distance?  If  so,  how?  The  answer  there  is  discouraging.  If  the  potential  goes  down  exponen- 
tially, then  the  S matrix  is  an  analytical  function  in  a strip  around  the  real  axis.  Suppose  for  a very  large  energy, 
I change  the  phase  shift  by  a tittle  kink-that  is,  I give  its  first  derivative  a discontinuity-then  the  S matrix 
cannot  possibly  be  analytic  in  that  strip  any  longer  and  therefore  the  potential  cannot  go  down  exponentially. 

Moses:  Dr.  S.  C.  Wang,  MIT,  tried  some  numerical  experiments  on  the  stability  of  the  Gelfand-Levitan  equation. 

He  varied  the  normalization  constants  a little.  It  is  quite  stable.  I also  tried  to  make  nonanalytic  scatter 
operators,  jumps,  s-ep  functions,  etc.  I didn't  solve  them  although  I think  it  is  solvable.  I think  you  will  get 
unitary  scattering  operators  and  so  on,  but  I think  you  get  singular  potentials  in  the  finite  domain. 

Parker:  Even  in  the  ideal  case,  you  still  nave  an  n-parameter  ambiguity,  a nonuniqueness— the  number  of  bound 

state*.  In  the  case  where  Q > 1,  what  is  thle  parameterization?  Is  it  < ountable,  or  nondenumerable,  or  how 
bad  is  the  nonuniqueness  when  you’ve  got  counter  examples? 

Newton:  We’re  talking  about  two  different  things.  You’re  talking  about  the  inverse  problem  from  the  cross  section 

to  the  amplitude.  That  has  nothing  to  do  with  the  number  of  parameters.  In  the  example  that’s  known,  it's  a 
threefold  ambiguity.  It  was  generated  numerically.  Somebody  took  a cross  section  and  searched  for  different 
amplitudes  to  fit  it.  He  found  three  solutions  on  a computer. 

Moses:  John  Lamont  and  1 looked  at  this  problem,  but  we  found  that  if  you  did  a measurement  through  slits  you 

can  find  complex  amplitudes  from  the  corresponding  cross  sections. 

Newton:  There  is  also  another  scattering  experiment  in  principle,  to  determine  the  phase  of  the  amplitude,  based  on 

the  Hanbury-Brown  and  Twiss  effect  of  intensity  interferometry.  Whether  this  is  feasible  in  the  particle  domain 
is  doubtful. 

Pninney:  1 *»n  concerned  with  the  existence  of  states  that  are  bound  at  positive  energies  and  can  tunnel  out.  I 

imagine  these  are  inaccessible  to  experimental  data  with  experimental  errors.  How  are  these  viewed 
practically? 

Newior:  There  are  two  kinds  of  such  states.  One  is  simply  a resonance,  a narrow  peak  in  the  cross-section,  which 

can  be  handled  by  the  inversion  method.  The  othe  vmg  is  actual  bound  states  at  postive  energy.  For  elastic 
scattering  by  'ocal  potentials  that  is  impossible.  For  nonlocal  potentials,  or  a three-body  problem,  this  is 
possible.  They  may  cause  difficulties. 
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Ptiinney:  I am  concerned  with  very  broad  resonances.  I have  a seismologjcal  analog  concerning  seismic  body 

waves  that  are  observed  at  a variety  of  distances  at  a given  frequency.  These  signals  illuminate  other  regions 
of  tHe  earth  but  are  not  permitted  to  enter  the  earth's  core  unless  tunneling  is  permitted.  From  a geometncal 
optics  view  *here  is  no  penetration.  If  the  core  had  a sufficiently  low  velocity  thit  the  states  were  perfectly 
bound  and  the  core  actually  had  resonances  that  could  not  be  observed,  I would  see  where  the  nonuniqueness 
lay.  We  are  unable  to  illuminate  these  regions  of  the  core,  anil  yet  the  states  that  exist  in  the  core  are  essen- 
tially resonances  and  the  effects  of  the  core  are  down  in  the  noise. 

Mew ton:  The  difference  is  that  you  don't  ask  for  specitic  mechanisms.  You  are  simply  taking  the  phase  shift  as  a 

function  ot  the  energy  for  granted.  Your  problem  is  much  more  complicated. 

Unidentified  speaker:  I would  like  to  make  a comment  on  the  application  to  seismology . The  seismological 

problem  was  modeled  at  Lincoln  Labs  some  years  ago  as  a one-dimensional,  layered  model.  In  that  case,  the 
inverse  problem  is  like  the  problem  of  a transmission  line  in  which  the  impedance,  or  index  of  refraction  is 
discontinuous.  The  kinds  of  effects  that  you  look  for.  leaky  states,  etc.,  can  be  taken  into  account  by  this 
same  algorithm  but  with  a different  class  of  potentials. 

Unidentified  speaker:  Yesterday  we  heard  problems  concerning  a complex  index  of  refraction,  which  would 

correspond  to  a complex  potential  here,  including  absorptive  effects.  Can  you  generalize  your  procedure  to 
handle  that? 


Newton:  Yes. 
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ABSTRACT 


A short  review  is  given  of  the  methods  of  solution  of  the  inverse  scattering  problem  at  fixed  energy  in  quantum 
mechanics.  The  attempt  has  been  made  to  cover  the  literature  up  to  April  197 1 . 


INTRODUCTION 


We  are  interested  in  scattering  experiments  of  a beam  of  particles  A at  a nonrelativistic  energy  L by  a target 
made  of  particles  B.  For  the  quantum  mechanical  description  of  the  system  A - B,  we  use  the  Schrodinger  equa- 
tion and  one  or  several  functions  depending  on  the  relative  distance  of  the  particles.  Our  inverse  problem  is  the  con- 
struction of  these  potentials  from  the  experimental  measurements. 

In  the  simplest  case-namely,  for  spinless  particles  and  isotropic  potentials-the  system  may  be  described  in  its 
own  center-of-mass  frame  by  the  Schrodinger  equation: 


A*(x)  + 11  - K(x)]  ¥(x)  = 0 


(1) 


The  above  equation  can  be  separated;  hence,  we  are  led  to  the  so-called  radial,  or  partial  wave,  equation: 


Dftjfir)  = i(i  f 1 )*{>) 


(2) 


where 


(3) 


The  partial  wave  \J/g(r)  is  the  solution  of  Eq.  (2)  whose  principal  part,  as  r goes  to  zero,  is  (1/2  r)®+ ' IX 1 /2 )/ rXB+3/2), 
so  that  for  V-  Owe  obtain: 
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O 

+2  (f)  = u£(r)  = 


(4) 


For  a physical  potential,  the  function  ^g(r)  must  have  the  following  asymptotic  behavior: 

/ 1 \ 

^£(r)  = A (,  sin  I r - — iti  + 5£J  + o (l)  r-»°° 

A sufficient  condition  for  the  existence  of  the  5g  is  that  the  fol'owi  g quantity  is  finite: 


(5) 


IFl  = p‘~c  iF(p)ldp  + /°V(p)ldp 

Jo  *0 


(6) 


where  a is  a general  length. 1 This  condition  defines  a set  O'  oi  potentials,  in  which  IFI  is  obviously  a norm. 
However,  the  set for  which  the  6£  are  defined  by  Eq.  (5)  is  much  larger  than  O :jP  contains,  for  instance, 
infinitely  repulsive  potentials.  The  cross  section  is  related  to  the  phase  shifts  by 


f(9)  = 2 (28  + l)e^8  sin  SgPgCcos 0)  (7) 

8=0 


o(9)  = lf(8)P 


(8) 


The  scattering  amplitude  f(6)  appears  in  the  asymptotic  behavior  of  the  wave  function 

i Hr)  = eik~,k'r-f(6)rleir 


(9) 


The  conditions  for  which  Eqs.  (7)  and  (9)  make  sense  are  not  necessarily  met  for  any  potential  of  Because 
of  the  bounds  of  the  Legendre  polynomials  for  sin  8 # 0,  a sufficient  condition  for  the  convergence  of  Eq.  (7)  is 
that  1 8£  1 be  bounded  by  Cfi-v:  Now,  according  to  a r uilt  of  Martin  (1965] , the  phase  shifts  are  bounded  for 

large  8 by 


C 


u|  (p)!K(p)ldp 


provided  that  this  quantity  goes  to  zero.  Therefore,  a sufficient  condition  for  a good  definition  of  f(6)  is  that 
p!/1  +c  I K(p)  1 be  bounded  for  large  p . 


1 Throughout  the  paper,  C is  a general  constant,  e an  arbitrarily  small  positive  number,  a a general  length.  They 
are  not  intended  to  have  the  same  value  every  time  they  are  used.  The  term  “bounded”  is  often  used  in  place  of 
“absolutely  bounded.” 
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This,  however  is  much  too  strong  a condition  in  general.  Let  us  defined'  as  the  set  of  potentials  for  which 
Eqs.  (7)  and  (9)  make  sense;  J>'  is  the  set  in  which  one  must  look  for  the  solutions  of  the  inverse  problem.  In  this 
simple  case,  the  inverse  problem  can  be  formulated  as  follows:  Construct  the  potentials  K(r),  (V  e ?')  that  yieh’ 
the  cross  section  o(0). 

As  in  most  inverse  problems,  the  questions  of  interest  changed  with  time  throughout  the  successive  studies  of 
this  problem.  In  the  first  studies,  authors  tried  to  obtain  an  approximate  solution  of  the  problem.  The  quest  for 
exact  solutions  led  to  the  realization  of  the  nonuniqueness  of  solutions  and  its  consequences,  which  suggested  further 
questions.  There  are  six  main  questions  essential  in  the  inverse  problem  at  fixed  energy. 

1.  Given  o(8),  does  a potential  V exist  in  "P ' that  generates  c at  the  energy  E through  the  Schrodinger 
equation?  Such  a potential  is  called  a solution  of  the  inverse  problem  at  the  energy  E. 

2.  Let  us  assume  a class  w of  functions  (wCJP')  that  contains  a solution  V of  the  inverse  problem  at 
the  energy  E.  Is  V the  unique  solution  in  v? 

Tnc  answer  to  (1)  being  in  general  affirmative  and  the  answer  to  (2)  being  in  gene.i!  negative,  we  are  led  to: 

?.  Give  a method  of  constructing  a solution  of  the  problem. 

4.  Let  us  call  “equivalent”  two  potentials  that  yield  the  same  cross  section.  Give  methods  for  constructing  ail 
the  equivalent  potentials  in  jP' , or  in  a large  enough,  well-defined  subset  o f2P‘. 

5.  Give  an  appraisal  of  the  deviation  from  each  other  of  all  the  equivalent  potentials  in  ~P'  or  in  a large  enough, 
well  defined,  subset  of  V . 

In  (1)  through  (5),  it  is  assumed  that  the  cross  section  was  exactly  known.  The  experimental  scattering  data 
are  necessarily  affected  by  errors.  We  must  find  out  whethf  i the  inversion  procedures  are  stable  with  respect  to  the 
perturbation  generated  by  the  noise  of  the  data: 

6.  Describe  the  evolution  of  the  set  of  equivalent  potentials  when  the  cross  section  is  submitted  to  i;r.dom 
perturbations. 

The  above  are  strictly  concerned  with  the  inverse  problem  at  fixed  energy.  In  a more  general  framework,  an 
interesting  question  would  be*  Let  o(0)  be  given  as  a functi^r:  of  E What  conditions  must  be  fulfilled  for  which 
at  least  one  of  the  solutions  is  a static  potential,  and  how  do  we  obtain  that  solution? 

We  have  formulated  all  our  questions  for  the  simple  problem  of  a spherically  symmetric  spinless  potential. 

With  trivial  modifications,  they  can  be  extended  to  the  various  generalizations  of  the  problem. 

The  historical  development  of  the  subject  has  been  similar  to  that  of  other  inverse  problems.  The  early  studies 
tried  to  solve  questions  (1)  and  (3)  by  trial  and  error,  or  by  approximate  methods.  General  methods  began  to  appear 
in  1959,  but  they  remained  widely  formal  until  a few  years  ago  when  they  were  systematically  developed,  giving 
definite  answers  to  (1),  (2),  and  (3).  The  solutions,  or  attempts  at  solution,  to  (4),  (5),  and  (6)  are  very  recent,  and 
the  subject  is  far  from  being  exhausted.  On  the  whole,  the  evolution  of  the  inverse  scattering  problem  at  fixed  energy 
has  been  very  rapid,  and  about  30  papers  have  been  sufficient  to  achieve  a state  of  solution  equivalent  to  that  requir- 
ing about  100  papers  for  the  inverse  problem  at  fixed  angular  momentum.  One  reason  for  this  rapid  evolution,  of 
course,  is  that  many  tools  could  be  constructed  by  analogy  with  the  inverse  problem  at  fixed  £.  There  are,  however, 
several  tools  introduced  in  the  present  problem,  such  as  the  interpolation  coefficients,  or  th . scattering  structure 
function,  that  were  developed  independently  of  the  inverse  problem  at  fixed  £,  and  their  ■ pplicability  to  that  problem 
would  be  an  interesting  subject  for  sti 

Here  we  follow  the  historical  order  in  the  evolution  of  the  inverse  scattering  problem  at  fixed  enc.gy  Except 
for  trial-and-error  methods,  the  published  studies  gave  separate  solutions  to  the  construction  of  the  scattering  amp- 
litude irom  the  cross  section  and  to  the  construction  of  the  potential  from  the  scattering  amplitude. 
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TRIAL-AND-ERROR  METHOD 


The  trial-and-error  method  certainly  is  the  simplest  way  to  give  an  answer  to  question  (3).  Calculations  of 
cross  sections  are  done  for  a series  of  trial  potentials  until  a satisfactory  fit  is  obtained.  Hence,  the  following  answer 
to  (3)  can  be  obtained:  Here  is  a potential  that  yields  a cross  section  approximately  equal  to  the  experimental  cross 
section. 

Because  of  the  experimental  errors,  three  parameters  related  to  the  depth  of  the  potenual,  its  range,  and  the 
range  of  its  “surface,”  respectively,  prove  to  be  sufficient  to  define  the  class  of  trial  potentials.  The  most  popular 
shape  of  potential  including  these  parameters  seems  to  be  tire  Saxon-Woods  shape: 


(10) 


V(r)  = A 


1 + exp  (r-R) 


where  A is  the  potential  depth,  R its  range,  and  a the  surface  width. 

The  method  can  easily  be  generalized.  It  has  been  used  for  potentials  including  spin-orbit  forces,  tensor  forces, 
imaginary  parts,  and  mort  get  erally  lT  situations  in  which  the  direct  problem  has  been  solved.  Fitting  procedures 
have  been  made  as  reliable  and  efficient  as  possible. 

Any  trial-and *rror  ‘ method”  is  nothing  more  than  an  algorithm  in  which  a series  of  observable  quantities  is 
calculated  from  the  interaction  and  compared  with  the  experimental  result.  Therefore,  the  differences  between  the 
various  procedures  are  concerned  either  with  numerical  analysis  or  'with  computer  tricks.  The  structure  of  the 
method,  its  physical  interest,  and  its  fundamental  limits  are  always  the  same.  Furthermore,  the  only  information 
that  can  be  obtained  by  a trial-and-error  method  for  the  inverse  problem  is  the  partial  answer  to  (3)  quoted  above. 

On  the  other  hand,  the  trial-and-error  methods  are  the  only  ones  in  which  the  existence  of  a static  potential 
consistent  with  the  experimental  results  has  been  studied.  In  nuclear  physics,  where  extensive  studies  have  been 
done,  trial-and-error  methods  have  made  plausible  the  proposal  that  the  nuclear  potential  depends  smoothly  on  the 
energy,  and  since  this  dependence  is  also  predicted  by  the  theory  of  nuclei,  we  have  every  reason  to  believe  it.  Also, 
the  failure  of  trial-and-error  methods  to  yield  a static  potential  to  fit  the  cross  sections  at  various  energies  does  not 
imply  that  such  a potential  does  not  exist.  Results  obtained  by  trial-and-error  method  are  necessarily  biased  by  the 
set  of  trial  potentials  used. 

Textbooks,  such  as  H'xigson  [1963]  contain  good  chapters  on  the  fitting  proceaures  used  to  apply  the  optical 
model  in  nuclear  physics.  Computer  programs  are  available  in  most  computer  libraries.  A classical  example  of  a 
detailed  description  of  a program  of  this  kind  has  been  published  by  Melkanoff  etal.  [ 196  i ] . 

Although  we  have  given  only  a cursory  appraisal  of  trial-and-error  methods,  their  importance  should  not  be 
underestimated.  We  believe,  they  are  relevant  to  the  direct  problem-not  to  the  inverse  problem.  They  have  been  of 
significant  value  in  the  development  of  scattering  studies  in  physics,  and  their  success  has  been  the  best  proof  of  the 
validity  of  the  optical  model  in  nuclear  physics.  Furthermore,  the  earliest  recognition  of  the  fundamental  ambigui- 
ties of  the  problem  can  be  attributed  to  trial-and-error  studies. 
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CONSTRUCTION  OF  THE  SCATTERING  AMPLITUDE  FROM  THE  CROSS  SECTION 


The  problem  of  constructing  the  scattering  amplitude  from  the  exact  knowledge  of  the  differential  cross  section 
at  a given  energy  could  theoretically  be  avoided  by  performing  special  measurements  on  the  scattered  particles. 
Goldberger  et  al.  [1963]  noted  this  possibility  on  the  basis  of  an  idea  proposed  by  Hanbury,  Brown  and  Twiss  [1956] . 
See  also  Twis  et  al  [1957] ; Goldberger  and  Watson  [1964;  1965a, b]  and  Goldberger  et  al.  [1966],  Although  this 
suggestion  may  be  of  use  in  some  cnses,  in  particle  physics  it  is  impossible  to  obtain  the  correlated  counting  rates  of 
two  detectors  with  the  beam  intensities  actually  available.  Furthermore,  it  is  not  obvious  that  by  overcoming  this 
difficulty  we  will  be  able  to  obtain  a solution  of  the  problem.  In  typical  situations,  the  scattering  phase  remains 
deeply  involved  in  the  formulas.  Therefore,  mathematical  ways  of  constructing  the  scattering  amplitude  have  a good 
chance  to  remain  useful  for  many  years.  The  solutions  of  this  first  step  of  the  inverse  problem  are  reviewed  by  New- 
ton in  the  preceding  discussion. 


CONSTRUCTION  OF  THE  POTENTIAL  FROM  THE  PHASE  SHIFTS 


Let  us  assume  that  the  first  step  of  the  problem  has  given  an  infinite  set  of  phase  shifts.  An  analysis  of  the 
second  step  of  the  inverse  problem  must  deal  with  the  six  fundamental  questions  given  in  the  introduction.  The  only 
modification  comes  in  questions  (1 ) and  (6),  which  we  replace  by: 

la.  Given  a sequence  6 = {6 g } of  phase  shifts,  consistent  with  Eqs.  (7)  and  (9),  does  a potential  V exist 
in  ~P'  that  generates  6 at  the  energy  El  Such  a potential  is  called  a solution  of  the  inverse  problem  at 
the  energy  E. 

6a.  Describe  the  evolution  of  the  set  of  equivalent  potentials  when  the  cross  section  is  submitted  to  random 
perturbations. 


JWKB  Method 

Development-  The  method  seems  to  have  been  introduced  by  Firsov  [1953] , and  Wheeler  [1955] . The 
presentation  beiow  is  due  to  Sabatier  [1966a] , who  gave  arguments  for  the  “right  choice”  of  an  interpolation. 
Extension  to  higher  orders  of  the  asymptotic  approximation  of  the  phase  shifts  also  has  been  given  by  Sabatier 
and  rediscovered  by  Vollmer  [1969] , who  made  numerical  calculations  using  the  method  for  practical  inversion 
purposes.  Miller  [1969]  has  discussed  the  method  in  terms  of  its  applications  to  problems  in  chemical  physics. 

Note  that  the  first-order  method  is  equivalent  to  the  inversion  method  in  classical  mechanical  given  by  Keller  et  al. 
[1956]. 

The  first  attempt  toward  a discussion  of  this  problem  used  the  JWKB  approximation,  if  we  assume  the  validity 
of  this  approximation,  the  phase  shift  6g  is  given  by: 


where  the  turning  point  rg  is  the  largest  zero  of 


K u 


K(r)]  -v«+l/2)J|. 


Let  us  now  introduce  the  function 


5-18 


T 


(12) 


tSUmm.  • to 


X(f)  = Ml-K(r))‘/3 


Let  us  also  assume  that  X(r)  is  equal  to  zero  for  r-r0  and  is  a monotonically  increasing  function  of  r for 
r>r0.  Let  r ■ \p(K)  be  the  inverse  function  of  X(r).  From  Eq.  (1 1)  it  follows  that: 


6 


C = 


(13) 


where 


H(\) 


(14) 


Within  the  above  assumptions,  V(r)  can  be  derived  readily  from  H(\).  The  problem  reduces  to  the  determina- 
tion of  H(\)  from  the  through  Eq.  (13).  This  is  straightforward  if  6g  is  known  as  a differentiable  function  of 
(8+  1/2),  say  d>(2  + 1/2),  going  to  zero  more  rapidly  than  (£  + l/2)-1-e  as  8 goes  to  <*>.  Then 


H(K)  = -2 f°°*Xx)(x7  dx 

X 


(15) 


Applications-  Suppose  we  are  given  an  infinite  sequence  (6g)  The  first  a priori  assumption  is  that  the 
function  X(r)  corresponding  to  the  unknown  potential  is  mono  Conically  increasing.  Physically,  this  means  that 
there  must  be  no  orbiting.  Then  we  have  to  decide  what  interpolation  we  have  to  choose.  Now,  from  Eq.  (7)  we 
derive 


exp  [2/6gJ  = 


l+i  jT  /(0)/>g(cos  0)  sin  6 d0 


(16) 


One  should  not  conclude  from  Eq.  (16)  that  the  “natural”  interpolation  of  6^  corresponds  to  the  usual  interpola- 
tion of  Pg(cos  0).  Rather,  an  analysis  of  the  errors  due  to  the  JWKB  approximation  for  an  even  potential  shows 
that  the  best  interpolation  may  correspond  to  an  unusual  interpolation  of  Pg(cos  0)  [Sabatier,  1966a] . The  exist- 
ence of  an  infinity  of  interpolations  shows  that  the  apparent  uniqueness  of  the  inversion  procedure  is  false. 

Actually,  we  see  below  that  the  choice  of  an  interpolation  completely  determines  the  exact  potential.  On  the 
other  hand,  only  smooth  interpolations  are  consistent  with  the  assumptions  of  the  JWKB  approximation,  so  that  it 
is  reasonable  to  use  any  smooth  continuous  fitting  of  the  phase  shifts,  and  to  take  for  granted,  or  to  check  a poste- 
rior, that  the  various  potentials  which  may  be  obtained  are  not  very  different  from  each  other.  This  wav  of  using  the 
above  method  may  be  of  interest  for  a real  potential.  When  it  is  used  for  a complex  potential,  the  experimental 
errors  on  the  strongly  absorbed  exp  [2j'6g]  can  yield  enormous  errors  on  the  potential  [Sabatier,  1966a]  There- 
fore, we  conclude  that  the  method  above  is  not  really  an  inversion  method  but  a fitting  procedure.  It  yields  to  the 
inverse  problem  the  answer  given  by  trial-and-error  methods,  nothing  more.  It  may  be  of  interest  to  save  computer 
time  or  to  study  qualitatively  the  relevance  of  a model. 
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A Special  Approach  for  Superposition  of  Yukawa  Potentials 
Martin  and  Targonski  [1961]  looked  for  a solution  of  the  problem  in  the  class  of  potentials  of  the  form 


V(r)  = r~l 


C(a)exp[-ar]  da 


U7) 


where  n is  positive,  and  C(a) e L , (p,00).  Now  let  t be  the  momentum  transfer  [r  = -2(1  -cos  9)].  Its  physical 
values  belong  to  [-4,0],  Let  T(t)  be  the  scattering  amplitude.  If  K(r)  is  given  by  Eq.  (17),  T(t)  is  analytic  in  the 
complex  r plane,  except  along  a cut  on  the  real  axis  from  t-n 2 to  f = +«o,  and  the  following  dispersion  formula, 
or  a conveniently  subtracted  form,  holds: 


m = dr' 


(18) 


where  2iriD(t)  is  the  discontinuity  across  the  cut.  This  formula  can  also  be  considered  as  an  integral  equation  for 
D(t).  It  enables  one  to  obtain  D(t)  by  extrapolating  the  values  that  T(t)  assumes  in  the  physical  region.  Once  the 
discontinuity  has  been  obtained,  the  potential  can  be  reconstructed.  More  precisely,  Martin  and  Targonski  [1961] 
show  that  in  the  Born  expansion  of  7"(f)>  the  discontinuity  across  the  cut  in  the  region  ft2  < t < (n  + l)1  n2  comes 
from  the  n first  Bom  terms  only.  Let  us  now  introduce  the  expansion 


nr)  = £>+  Vn+] 


(19) 


where 


, /*(8+l)jr 

Vo  = r~‘  I C( a)  exp  [-ar]  da  2 < n 

r"*>  ‘ r"  4”i)K  <**>“>> 


(20) 


note  that  V„+]  does  not  contribute  to  the  discontinuity  for  ji1  < t < (n  + l)2  n2 , while  Vn  does  contribute 

through  T,  only.  Now,  assume  that  C(a)  is  known  from  a-n  to  a*np..  This  yields  Kj , Fj Vn_  j , 

from  which  the  discontinuity  of  7j  ■*••••♦  Tn  in  the  region  n2yt2  < t < (n  + l)2  can  be  derived.  We  can 
the),  obtain  C(a)  in  this  interval  through  the  formula 

2ni r“,/,  C(rua)  * 2niD(t)  - discontinuity  of  (7*a  + •••  + Tn) 

Through  this  iterative  procedure,  we  can  reconstruct  C(a)  from  the  discontinuity  of  T across  the  cut. 
Martin  and  Targonski  [1961]  give  the  necessary  (but  not  necessarily  suLlcieni)  conditions  for  the  scattering  ampli- 
tude to  be  generated  by  a superposition  of  Yukawa  potential..  The  uniqueness  of  the  solution  follows  from  the 
analytic  continuation  and  the  iteration  procedure.  The  method  can  be  easily  extended  to  exchange  forces.  It  can 
also  probably  be  extended  to  potentials  bounded  by  decreasing  exponentials  (Martin,  private  communication). 
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Numerical  computations  are  not  feasible  Pesides,  since  the  method  involves  an  analytic  continuation,  it  is 
generally  unstable  with  respect  to  experimental  or  roundoff  errors  [ Viano,  1969) 


Approaches  Through  a Gelland-Levitan  Procedure 

Regge  [1959]  introduced  an  approach  to  the  scattering  problems  at  fixed  energy  fairly  close  to  the  Geltand- 
Levitui  procedure  introduced  in  rroblems  at  fixed  8.  In  such  a method,  given  two  potentials  VQ  and  V,  one 
looks  for  a “transformation  kernel”  K y ^ (r,  r'),  which  generates  the  wave  function  from  the  wave  function 

i/'g  0 through 


i<iV(r)  - - I Ky  V{r,p)\l/tV°(p)p  2 dp 

Jrj 

Kyjir.r')  = --jr  I p \V(p)  - VQ(p)\  dp 
^ Jo 


(21) 


(22) 


Therefore,  if  VQ  is  a well-known  potential  with  well-known  wave  functions,  the  scattering  problem  reduces  to  the 
determination  of  Ky  ^(r  r).  For  this,  it  is  convenient  to  introduce  an  auxiliary  tool,  the  input  function 
/ y^(r‘ r')-  which  is  a^olution  of  the  partial  differential  equation 


(d?°-d/0)  fyjir.r)  =0 
fyo  V(r,o)  = fvV(o,r)  = 0 


(23) 


and  enables  one  to  obtain  the  transformation  kernel  through  the  “fundamental’'  integral  equation 


KvV(r,r')  = fvV(r,r)  - 
ro  r0 


K yV (r,  p)fy^(p,r) p-2  dp 


(24) 


obtained  by  analogy  with  the  Geifand-Levitan  equation.  Therefore,  the  scattering  problem  is  completely  solved 
when  fy  V(r,  r ) is  known.  A convenient  representation  of  this  machinery  is  given  by  the  triangular  diagram 
[Sabatief!  196<2] . 
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Note  that  Eq.  (24)  is  a Fredholm  equation.  If  r' -+fy^(r,r')  belongs  to  I i(0,r),  the  Fredholm  alternative 
holds.  If  we  keep  only  continuous  solutions  of  Eq.  (23),  the  Values  of  r for  which  the  homogeneous  form  of  Eq. 
(24)  has  a solution  correspond  to  singularities  of  V(r).  At  any  regular  point  of  K(r),  therefore,  Eq.  (24)  has  one 
solution  only.  With  weak  additional  assumptions,  it  is  possible  to  show  that  the  singularities  are  isolated.  It  is  easy 
to  show  from  Eqs.  (23)  and  (24)  that  the  following  equations  hold  at  any  regular  point: 


(z>/-Z>/°)  KyoV(r.r')  = 0 

KvV{r,Q)  = KvV{OS)  = 0 
To  r0 


(25) 


v , 

Two  kinds  of  studies  have  used  fy  (r.r)  as  the  fundamental  tool  for  a solution.  In  the  Regge-Loeffel  method, 

the  questions  (1)  and  (2)  are  well  studied  but  (3),  (4),  and  (5)  are  definitely  neglected;  the  empkaris,  therefore,  is  on 

a general  representation  of  fy  Y (r,  r).  In  the  Newton-Sabatier  method,  question  (3)  is  dominant,  and  (1)  and  (2) 

follow;  the  emphasis,  therefor?,  is  on  representations  of  fyV(r,  r)  that  permit  effective  construction  of  the  potentials. 

ro 


Regge-Loeffel  Method  - Introduced  by  Regge  [1959]  and  thoroughly  studied  by  Loeffel  [1968] , this  method 
makes  use  of  the  above  machinery  with  V0  * 1.  The  unperturbed  partial  wave  equation  thus  reads: 


r2 


il 

a rJ 


il'gW  * 2(8+l)<l'£(r) 


(2b) 


y 

Hence,  4/%°(r)  reduces  to  a power  of  r.  Using  Gi’lfand-Levitao  [1951]  methods,  Loeffel  was  able  to  prove 
that  the  function  f^ir,  r)  corresponding  to  any  potential  of  o has  the  following  representation 

/»*W)  = --(2*)-'  I l{T)(xyf-xn  + 2_J  *k(xy)Vk'm  (27) 

1 

where  the  continuous  functions  y(r)  and  the  sequences  d*  an  t vg  are  unique  for  a given  potential.  Actually, 
these  quantities  appear  in  the  spectrum  of  the  differential  operator  - (d/dx)  [x2  (d/dx)]  - 1/4  + x2  [K(x)  - 1 ] and 
thus  are  termed  the  spectral  data  by  Loeffel.  The  series  in  Eq.  (27)  converges  uniformly  for  (x.y)  in  any  compact 
contained  in  R+X/?+.  The  integral  symbolizes  a unitary  operator  V defined  on  Lj(0,*»)  whose  application  to  a 
function  of  Li(0,°°)  should  take  into  account  the  convergence  in  Lj  of  all  the  integrals.  Using  this  complete 
characterization  of  any  potential  in  a,  it  has  been  possible  to  prove  by  function-theorcuc  methods  the  two  follow- 
ing uniqueness  theorems  [Loeffel,  1968],  where  v is  the  “continuous  variable”  X + 1/2: 

Theorem  l . Let  V\  and  U}  be  in  o Let  cti  and  he  the  corresponding  Jost  functions.  If 
ot 1 0»)  = aj O')  for  Re  v > 0,  then  K,(x)  » Rj(x)  for  almost  all  positive  x. 

Theorem  2.  Let  jg  » exp[2i  5g] , let  o(?  ♦ 1/2)  be  the  interpolation  of  »g  obtained  through  the  Jost 
functions  (the  so-called  “Regge  interpolation”),  and  let  K,  and  K2  be  in  the  clan  &.  If  the  corresponding 
Regge  interpolations  o,  and  a->  satisfy  c,  (v)  - <t}(v)  for  all  v with  Re  v > 0 where  both  are  hoiomor- 
phic,  then  V{(x)  = Fj(x)  for  almost  all  positive  x. 
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The  problem  of  uniqueneti  therefore  reduces  to  the  step  ig  -+  o(B  + 1/2).  At  this  point,  there  can  be  unique- 
ness only  for  particular  classes  of  potentials  that  allow  the  interpolation  to  be  unique.  This  holds  if  the  analytic 
properties  of  the  Jost  functions  enable  one  to  apply  a uniqueness  theorem  such  as  Carlson’s  theorem  or  the  Lagrange- 
Valiron  theorem.  If  a subclass  of  a is  chosen  such  that  the  solution  is  unique,  and  if  the  Regge  interpolation  is 
known,  question  (1 ) can  be  answered  positively  in  certain  cases  [Loeffel,  1968 \Burdet  et  al,  1965]  and  formal 
answers  can  be  given  to  (3).  However,  attempts  to  answer  (5)  and  (6)  are  lacking,  and  no  constructive  method  is 
available.  Besides,  the  stability  of  a method  using  an  interpolation  process  as  an  intermediate  step  is  highly  question- 
able [Viano,  1969]. 

Newton-Sabatier  methods  - Let  us  use  the  machinery  quoted  above,  with  1^-0,  and  following  Newton 
[1962]  let  us  introduce  the  series 


/>, r)  = 22  cg“g(r)Me(r’)  (28) 

0 


where  lcgl<C(l+8)  for  any  8.  Hence,  the  expansion 


OO 

r')  = 2L* c e 1 * fiW“ 

o 


(29) 


follows  for  Kq  (r,  r').  Substituting  Eq.  (29)  in  Eq.  (21),  we  obtain 


v(r)  = Ug(r)-  T^loo»(r)co»«[>o»(r) 
8 


(30) 


where 


igg'fr)  = f t‘g'(fi)ue  (p)p-1  d.o 


(31) 


The  inhnite  system  of  coupled  linear  algebraic  Eq.  (30'  ' equivalent  to  the  integral  Eq.  (24).  Letting  r-^<» 
in  Eq.  (30)  and  using  Eq.  (5)  we  obtain  the  system 


where 


Ag  ei6t 


Cg~AgeiS « 
(28  + 1) 


Af  t 


(6g# 


(32) 


if  8'  - 8 is  odd 


if  8'  - 8 is  even 


(33) 
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1 


ir-'r' 


Considering  separately  the  real  and  the  imaginary  parts  of  Eq.  (32),  we  obtain  the  systems: 

tan  8g  * ^82'flg'0  + t,n  ®8  tan  ®B') 

8' 

I 1 at 

cg  * ag(l  + tan1  6g)  1 1 rr  (1  + tan*  6g)  - > (tan  6g>  - tan  6j) 

L 22+1  o' 


(34) 


(35) 


where 


a g = c gA  g cos  6 q 

In  matrix  notation,  system  in  Eq.  (34)  reads 


(36) 


tan  Ae  = W(1  + R) a 


(37) 


where 


R = M 1 tan  AAf  tan  A 


(38) 


This  ends  the  formal  invasion  procedure.  It  is  clear  tnat  if  M~l  and  (/  + R)~'  can  be  obtained,  a set  of 
coefficients  a g can  be  obtained  from  Eq.  (37).  When  inserted  in  Eq.  (35),  they  yield  a set  of  eg'  that  yield  all  the 
quantities  in  the  problem.  Although  this  method  appears  very  simple,  progress  beyond  these  formal  aspects  has 
been  surprisingly  difficult. 

Using  the  properties  of  an  auxiliary  matrix  N,  Newton  [l%2]  was  able  to  prove  the  existence  of  a matrix 
M~ 1 bis'ded  inverse  of  M and  of  a vector  v such  that 


My  = 0 


(39) 


A study  bv  Redmond  11964]  improved  out  knowledge  of  the  matrix  N,  but  not  until  1966  were  M_l  and 
v explicitly  given,  together  with  the  analogous  quantities  corresponding  to  a class  of  infinite  matrices  [Sabatier, 
19666] . The  result,  for  M is: 


M~l  -pAfp 


(40) 


vsg  “ 
Vig+i  * 0 


(41) 
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where 


4 

r(n  + 1/2) 

I*/,  .) 

n*n  = T 

7T 

F(n+1) 

r-  1 

4 

T(n  + 3/2) 

_r(n+  1) 

1 r 

When  (1  + R)  is  invertible,  the  solution  of  Eq.  (37)  is  therefore 

t = (1  + R)~x  (av  + M~x  tan  Ae) 


(42) 


(43) 


(44) 


where  a is  an  arbitrary  parameter. 

It  has  beer,  possible  [ Sabatier . 19666]  to  prove  that  the  method  is  consistent  for  almost  every  set  of  phase 
shifts  satisfying  the  inequality 


tan  6gl  <C(C+  1) 


-s  -e 


and  that  it  yields,  for  each  a,  a potential  V(r)  bounded  for  positive  r 


V(r)  - C(a-0)r  32  cos  l2/- 


(2' ' 7') 


and  going  to  zero  as 
0(re_1)  r-*”> 


(45) 


(46) 


where  0 depen.  on  the  phase  shifts  only.  Among  these  equivalent  potentials,  we  see  that  one,  and  only  one,  goes 
to  zero  more  rapidly  than  r~rn  and  may  exhibit  a nonoscillating  tail.  If  the  6 g are  chosen  equal  to  zero,  a non- 
trivial class  of ‘‘transparent  potentials”  that  depend  on  a is  obtained. 

We  noted  that  the  method  works  for  almost  every  set  of  phase  shifts.  To  obtain  the  «g,  we  need  (1  + R)~l ; 
on  the  other  hand,  once  obtained,  the  yield  the  phase  shifts  6g  it  the  system  of  Eq.  (32)  has  only  one  solution 

A j e * ® ® , in  both  cases,  the  required  conditions  imply  the  regularity  of  matrices  of  the  form  1 + MX , where  X 
is  a diagonal  matrix  (with  element  equal  to  ic g,  or  to  tan  6g)- 

With  the  inequality  in  Eq.  (45),  the  elements  of  X are  bounded  and  MX  is  a completely  continuous  opeiator. 
Therefore,  ! + MX  is  invertible  unless  the  phase  shifts  belong  to  exceptional  sets  for  which  MX  has  -1  as  an 
eigenvalue. 

An  interesting  consequence  of  the  bounds  required  to  make  the  method  consistent  is  that  the  ciass  of  poten- 
tials that  can  be  obtained  is  relatively  narrow.  For  the  convergence  of  Eq.  (32),  it  is  necessary  that  i Cj  1 < C8 . It 
follows  that  fg{r,  r)  is  an  entire  function  of  r and  r , that  Eq.  (24)  h«  »r.  unique  solution  for  almost  every  r, 
and  that  K(r,r  ) is  a meromorphic  function  [Sabatier,  1966c].  The.efore,  V(r)  is  a meromorphic  function,  with 
poles  of  order  2,  all  bounded  away  from  r * 0.  Another  limitation  on  the  class  of  potentials  comes  from 
the  fact  that  C in  Eq.  (28)  can  be  an  integer  only.  Even  potentials  cannot  be  obtained  in  this  way.  For  these 
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reasons,  Sabatier  [1967a]  gave  a generalization  of  the  method  by  allowing  8 in  Eq.  (28)  to  take  any  real  values 
larger  than  -1/2.  When  this  set  S of  values  contain  the  integers  and  all  the  eg  are  absolutely  bounded,  the  inverse 
problem  can  be  solved  and  yields  a much  larger  class  of  equivalent  potentials.  It  can  also  be  solved,  at  least  when  all 
the  tanfig  are  small  enough,  if  £ is  allowed  to  take  half-integer  values  only  [Sabatier,  1967a]. 

The  bounds  of  the  Cg  also  yield  interesting  representations  of  the  Jost  function  in  the  8 complex  plane.  It  is 
convenient  to  use  the  notation  p for  8+  1/2,  yu{V,  o)  for  c„_,,2 , -v,.  for  !ru_,/2.  <PU  for  3 . so 
that  tf(r.r')  reads 


(47) 


Let  us  assume  that  the  y^  are  absolutely  equibounded.  The  Jost  functions  are  then  given  by  [Sabatier,  1966c; 
1967c] 


/,  2 (X)  = exp 


an  [(X  -p)(rr/2)j 
X1  -u1 


7 M(K, (?)/,.;»  Or) 


(48) 


which  can  be  interpreted  as  a special  dispersion  fo.Tnula  [Newton,  1967] . When  Sa  contains  all  the  nun.bers  of  the 
form  n/k  ( k , fixed  integer,  n = 1 , 2, . . .),  it  can  be  interpreted  as  a Lagrange-Valiron  interpolation  formula  [Sabatier, 
1967b]  or  a denved  form  of  such  an  interpolation.  Other  interpolation  formulas  can  be  obtained  for  the  v-a.e  func- 
tions which  are  valid  even  for  y^  strongly  increasing  a This  interpretation  le<t  us  to  the  additive  formula 


7MW  V)  = TpW  Vq)  - yM(K  Vq) 


(49) 


The  interpolation  formulas  for  the  wave  functions  make  it  possible  to  derive  the  integral  Eq.  (24).  Therefore, 
as  pointed  out  and  illustrated  by  Sabatier  [1968] , generalizations  of  the  interpolation  formulas  may  yield  generaliza- 
tions of  the  Regge-Newton  procedure. 

The  Newt on-Saba tier  method  can  be  used  to  construct  exactly  solvable  potentials  [Newton,  1962;  Sabatier, 
1967a] . In  some  cases,  it  is  even  possible  to  obtain  a closed  form  of  the  scattering  amplitude  [Sabatier,  1966c;  Cox 
and  Thompson,  1968] . In  addition,  throughout  this  method,  the  first  moment 

/~rK(r)dr 


can  be  related  to  the  asymptotic  behavior  ol  the  Cg  [Sabatier,  1967c] ; in  particular,  if  they  vanish  for  8 > L < **, 
this  moment  is  zero  [Newton,  1967] . Some  authors  have  emphasized  this  point  [Cox  and  Thompson,  1970a; 
Coudray  and  Cot,  1971]  and  have  given  » method  [Cox  and  Thompson,  197Qa,b]  to  avoid  what  they  consider  an 
essential  complication  for  numerical  calculations.  In  our  opinion,  this  point  is  certainly  over-emphasized  (see  below 
for  numerical  calculations).  Its  main  effect  is  to  produce  oscillations  in  the  tail  of  the  potential. 
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Generalizations  of  the  Regge-Newton  Proce^u.t 

Relatively  direct  generalizations  of  the  Regge-Newton  procedure,  in  the  framework  of  generalized  translation 
operators,  include  those  to  Coulomb  potentials  [Coudray  am,  Coz,  1970]  and  relativistic  cases  [Coudray  and  Coz, 
1971],  Generalizations  to  spin-orbit  potentials  [Sabatier,  1968]  and  tenser  forces  [Hooshyear,  1970]  necessitate 
much  deeper  transformations  of  the  machinery.  All  these  studies  are  still  in  a “formal”  state,*  and  much  work  is 
needed  to  clarify  their  details. 


COMPLETE  SOLUTION  OF  THE  INVERSE  PROBLEM  AT  FIXED  ENERGY 


The  methods  discussed  in  the  preceding  section  yield  answers  on’.y  to  questions  (1),  (2),  and  (3).  In  the 
“Regge-Loeffel”  method,  the  answers  to  (1)  and  (2)  are  very  general,  but  there  is  almost  no  answer  to  (3).  In  the 
methods  using  the  coefficients  c^,  the  answer  to  (3)  is  confined  to  special  classes  of  potentials,  which  we  generally 
denote  as  “the  class  C.”  In  these  methods,  if  we  allow  p to  have  complex  values,  taking  care  of  the  reality  of 
»’(r),  it  is  certainly  possible  to  obtain  expansions  of  Eq.  (27)  that  are  complete  in  a large  space  of  functions.  At 
the  same  time,  however,  we  preclude  the  use  of  the  inverse-matrix  procedures;  furthermore,  inclusion  of  the  integral 
values  of  p would  give  redundant  expansions.  Sabatier  [1970, 1971]  has  proposed  the  following  method  of  solu- 
tion to  the  inverse  problem,  which  allow  answers  to  all  six  questions  (1)  through  (6). 

Let  ff  be  the  class  of  functions  that  are  absolutely  bounded  by  CV~1  + e and  Cr-3-e,  and  £ the  class  of 
potentials  V(r)  such  that  K(r),  r V" (r),  and  r2K"(r)  belong  to  & . It  has  been  proved  that  for  any  potential  of 
S,  the  transformation  kernel  K(r,  r)  and  its  derivative  with  respect  to  r can  be  written  [Sabatier,  1970] 


^1  -i— ^K(r,r')  = - ir'J^(r')e\p  [ir]  + KN(r, r') 

where  K^f(r,r)  is  a so-called  “negligible”  function  with  respect  to  r,  namely: 

KN(r,r')^0  r-+<* 

I Kj^r,  r)  1 [r  (1  + r')]  1 dr'-*  0 


(SO) 


f -►  oo 


(51) 


Using  Eqs.  (21),  (SO),  and  (51),  it  is  possible  to  obtain  the  remarkable  formula 

jf  J{(p)us(p)p-1  dp  = exp  i C y)[  1 - A g exp  (i  6 g)]  (52) 

The  scattering  structure  function  J^(p)  can  be  related  bijectively  to  V(r).  The  important  step  in  the  inverse  prob- 
lem therefore  resides  in  obtaining^ (p)  from  the  6g  through  Eq.  (52).  This  is  a kind  of  generalized  moment 
problem  and  could  be  treated  in  this  way  [Sabatier,  in  preparation] . It  is  also  possible  to  use  an  asymptotic  form 
of  a Gelland-Levitan  formulas  [Sabatier,  1971  ] . For  potentials  of  5 , the  function  fo^ir,  r')  and  its  derivative 
can  be  expanded: 
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(>  -«  exp  [irj  +/yV(r,rl) 


(53) 


whew  /^(r,  r')  is  negligible  with  respect  to  r . From  Eqs.  (.24),  (50),  and  (53),  we  obtain 


-^(O  =3'{r)~  Jf  dp 


(54) 


where 

^(p.r)  - (rr')~ly(r,r) 


(55) 


Equation  (54)  supersedes  the  Regge-Newton  Eq.  (24)  in  all  situations  where  r is  largest  — that  is,  in  scattering 
studies.  Again,  for  potentials  of  £ , it  is  possible  to  prove  that  [Sabatier,  1970,1971] 


(r,  r')  = 4 l.i.m. 


sin  w uF(u)  d u 


(56) 


where 


w = [(r-r'/,  + 4rr'n2],/J 


(57) 


F(u)  * -(271)-*  V0(u~l  - l)tfl  -rO  + tKu) 


(58) 


is  the  first  moment  of  the  potential,  O'  the  Heaviside  function,  ip(u)  a function  of  and  l.i.m.  means 

limit  in  L The  premise  of  our  method  is  that  F(u),  which  is  the  sine  Fourier  transform  of  r*(  r,r ),  can 

be  split  into  two  parts. 


F(u ) = F(u)  0\\  -u)  + F{u) &(u  - 1) 


(59) 


When  inserted  in  Eq.  (56),  the  t no  parts  of  the  right-hand  side  of  Eq.  (59)  give  rise  to  two  functions:  3*  ^(r,  r), 
whose  spectrum  is  confined  to  (0,1);  and  3E{r,  r),  whose  spectrum  is  confined  to  (1,«>).  These  two  functions 
are  called  the  internal  and  external  projections  of  3*  (r,  r),  respectively,  and  both  can  be  expanded  as  in  Eq.  (53)  to  yield 
the  internal  and  the  external  projections  of  & (r')>  say,  V)  and  3E(r).  For  convenience,  we  also  define  the 
slightly  different  functions 


3'\r,r‘)  * 3\r,  r)  + 6 (rr')  1 sin  r sin  r 
3E(r,  r)  = r)  - 6 (rr')~l  sin  r sin  r 


(56) 
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and  the  corresponding  quantities  3^(r)  and  3'E(r).  We  are  now  in  position  to  solve  Eq.  (54).  For  this  purpose 
we  define  ]r[  J(r)  and  Ji.E(r)  by 


J</(r)  * ?7(r)  - / >t0>)?7(p,r)dp  (57) 

o 

HE(r)  = 3L(r)  - Jf  J((fi)3E(ft.r)  dp  (58) 

Clearly,  Eq.  (54)  is  satisfied  if  and  only  if 

J(£(r)  = J((r)  (59) 

We  now  choose  5^(r,  r')  arbitrarily  in  a class  of  functions  3*  to  which  the  functions  &E{r,  r)  correspond- 
ing to  potentials  of  $ do  belong.  This  class  of  functions  is  thoroughly  studied  in  Sabatier  [1971  j . Let  ff>£(r.  r) 
be  the  resolvent: 


(f.E(r,  r')  = 3E(r,  r')  - jf  °°  £E(r,  p)  fE{p,  r)  dp 


(60) 


Using  Eq.  (60),  we  can  write  Eq.  (58)  as 


JtE(r)  = >{E(r)  - So°°y\!ip)  tfiP.  r)  dp 


(61) 


where  Xf (r)  can  be  derived  in  a straightforward  manner  from  3 £{r,  r)  and  fyE(r,  r),  and  therefore  can  be  con- 
sidered a known  quantity  as  soon  as  &E(f,  r)  has  been  chosen  in  3f . 

By  inserting  Eqs.  (61)  and  (59)  into  Eq.  (57)  w«  new  can  obtain  an  equation  reducible  to  an  infinite  system  of 
linear  algebraic  equations  that  can  be  exactly  solved!  Since 


^7(r,  *■')  * 4 /*  w“l  sin  w F(u)u  du 


(62) 


we  can  write 


^(r.r')  * ( rr')”1  cg«g  (r)u8(r') 

0 


(63) 
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whcre 


c g = 4(28+1)  I F(u)J>g(l-2if,)u  d u (64) 


The  coefficients  c g are  absolutely  equibounded  and  go  to  a constant  as  8 -*■ «. 

Similar  formulas  can  be  obtained  for  r),  and  3^(r ).  Inserting  them  in  Eq.  (57)  and  taking  into 
account  the  information  contained  in  Eq.  (52',  we  obtain  the  remarkable  formula 


OO 


which  is  independent  of  the  choice  of  J?E(  r,  r). 

Now  inserting  Eqs.  (65),  (59),  and  (61)  into  Eq.  (52),  we  obtain  the  infinite  system  of  equations 


i4g  exp 

i(V8f)]+flfiexp[i-  (a8'e7)] 

= exp(-«8-^j-  Cp'Lpp'Ap'  exp|i^c--8'  -j)J  + ^ expji  - 8'  yjj 

(66) 

where  we  set 

^8_8y)]  = 1 ^-E^u^p  ‘ ^ 

(67) 

gpp'  * jf  Jf  ^.E(x,y)up(x)up>{y)x~l  y~  dx  dy 

(68) 

The  quantities  Bp,  og)  gg£>,  are  known  after  the  choice  of  3E{r,  r).  (Actually  ggj>  can  be  given  an  explicit 
form  in  terms  of  Bp  and  og).  If  rjp  stands  for: 


+?8  = Bp  (sin  ap  ~ cos  og  tan  5g) 


(69) 


It  is  possible  to  show  that  vanishes  more  rapidly  than  B-1  e as  £-*•«>,  and  to  reduce  the  system  to  the  form 

(t?  + tan  A)e  = Af(l  + S)  a (70) 
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where  <jg  = cgAgCOsfig,  and  S is  a conveniently  defined  matnx.  As  in  the  Newton-Sabatier  method,  it  is  possible 
to  show  that  1 + S is  invertible  for  almost  every  set  of  phase  shifts.  Since  5 depends  on  the  arbitrary  function 
r')  it  is  possible  to  choose  it  in  such  a way  that  (1  + S)~*  exist  and  that  the  Cg  and  the  function!?^/-,  r) 
again  give  the  6g. 

Therefore,  the  method  just  described  gives  an  answer  to  question  (1)  in  a class  of  potentials  that  contains  £ . 

It  gives  a negative  answer  to  (2),  and  the  solution  depends  on  an  arbitrary  function  of  3 . It  gives  a positive  answer  to 
(3)  in  a class  of  potentials  that  contains  £ . Since  C is  hardly  smaller  than  the  class  & of  potentials  in  which 
practically  all  the  results  of  potential  scattering  have  been  obtained,  these  results  are  already  useful.  Moreover,  the 
method  gives  a plain  answer  to  (4)  in  the  class  £ . Still  more  interesting,  it  shows  that  the  deviation  of  all  potentials 
from  each  other  can  be  analyzed  in  terms  of  the  values  of  F(u)  between  u = 1 and  u ~ 00 . If  we  choose  inside 
a class  of  potentials  characterized,  for  instance,  by  given  bounds  on  the  derivatives,  the  importance  of  F(u) 
between  1 and  » is  smaller  for  smaller  bounds,  and  in  any  case  decreases  and  goes  to  zero  as  E -*  <*>.  Therefore,  the 
deviation  of  equivalent  potentials  from  each  other  is  smaller  when  they  are  more  “smooth,”  and  in  any  case  becomes 
smaller  and  smaller  as  E increases.  These  results  have  been  sketched  and  checked  through  numerical  computations 
by  Sabatier  and  Quyen-Van-Phu  [1970] . Except  for  a preliminary  attempt  by  Sabatier  [1967c] , the  last  reference 
is  the  only  available  study  on  the  deviation  of  equivalent  potentials  from  each  other.  It  is  also  the  only  available 
answer  to  question  (5),  and  it  paves  the  way  for  practical  uses  of  the  inversion  proced  ir*. 

Detailed  answers  to  question  (6)  are  still  lacking.  A general  answer  can  however  be  given  using  the  method 
above  Since  this  method  is  constructive,  it  is  easy  to  check  the  continuity  of  all  of  its  steps,  in  convenient  normed 
spaces  and  hence  to  give,  in  general,  an  affirmative  answer  to  question  (6),  or.ee  F(u)  has  been  chosen  for 
u € (I,l°). 
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DISCUSSION 


Calogero:  Within  a certain  class  of  potentials,  Martin  is  able  to  solve  all  the  problems  you  have  mentioned.  He  is 

able  to  prove  existence,  prove  uniqueness,  etc. 

Moses:  You  don't  use  analytic  continuation  in  fi.  Have  you  tried  using  this  in  the  one-dimensional  problem? 

This  has  application  to  electromagnetic  theory. 

Sabatier:  There  is  a difference  between  this  problem  and  the  one  at  fixed  8. 


¥ 


u 

& 


3r 

I 
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EXPUCIT  EXPRESSIONS  OF  THE  POTENTIAL  AND  ITS  DERIVATIVES 
AT  THE  ORIGIN  IN  TERMS  OF  THE  SCATTERING  DATA 

F.  Calogero 

Istituto  di  Fisica  dell'  University  di  Roma 
Istituto  Nazionale  di  Fisica  Nucleare,  Sezione  di  Roma.  Rome,  Italy 

ABSTRACT  3"  1 1 6 1 ^ 

The  framework  of  this  review  paper  is  the  quantum  mechanica’.  theory  of  scattering  of  a particle  by  a 
spherically  symmetrical  potential.  As  in  the  inverse  scattering  problem,  the  input  of  the  calculation  is  the 
scattering  and  bound-state  data,  and  the  output  is  dat:>  on  the  potential.  The  results  discussed  are  explicit 
expressions  for  the  values  of  the  potential  and  its  derivatives  at  the  origin  in  terms  of  the  scattering  and  bound- 
state  data.  Various  methods  to  obtain  these  results  are  outlined.  The  presentation  is  aimed  at  introducing  these 
various  approaches;  the  simplest  scattering  problem  (nonrelativistic  5-wave  scattering  on  a holomorphic  poten- 
tial without  bound  states)  is  used  as  the  basis  for  discussion,  and  technicalities  are  omitted  whenever  possible 
without  loss  of  clarity.  A complete  compilation  is  given  of  the  results  obtained  to  date  in  this  field,  including 
the  treatment  of  higher  partial  waves  and  the  Klein-Gordon  and  Dirac  equations. 

INTRODUCTION 


The  study  of  the  inverse  problem  in  scattering  theory  consists  in  the  development  of  mathematical  tech- 
niques to  obtain  information  on  the  cause  of  (he  scattering  from  the  parameters  that  are  measured  in  a scatter- 
ing experiment.  In  the  framework  of  quantum  mechanics,  major  attention  has  been  given  to  the  case  of  a non- 
relativistic particle  interacting,  via  a potential,  with  a scattering  center;  or,  equivalently,  that  of  two  particles 
interacting  through  a potential  depending  on  their  relative  distance,  a problem  that  reduces  to  the  previous  one 
after  the  (trivial)  center-of-mass  motion  has  been  separated  out.  Even  for  this  specific  case,  several  different 
“inverse  problems”  can  be  stated,  depending  on  the  input  information  taken  as  the  starting  point  of  the  calcu- 
lation. This  paper  focuses  on  the  problem  in  which  the  input  information  is  one  scattering  phase  shift  (assumed 
known  for  all  energies)  and  the  parameters  of  the  corresponding  bound  states. 

This  inverse  scattering  problem  has  been  solved  by  1.  M.  Gelhrnd  and  B.  M.  Levitan  and  by  V.  A.  Marchenko; 
a detailed  account  of  it  is  given  by  Newton  in  the  opening  paper  of  this  chapter.  The  assertion  that  this  problem 
has  been  solved  means  that  existence  and  uniqueness  properties  have  been  stated,  and  that  procedures  for  the 
reconstruction  of  the  potential  function  from  the  input  data  have  been  given.  These  procedures  involve  quadra- 
tures and  the  solution  of  an  integral  Fredholm-type  equation. 

Ihe  developments  reviewed  in  this  paper  constitute  an  attempt  to  bypass  the  practical  difficulty  implied 
by  the  need  to  solve  an  integral  equation;  the  goal  is  to  obtain  informa  tion  on  the  potential  in  the  form  of 
explicit  expressions  involving  the  scattering  and  bound-state  data.  Specifically,  attention  concentrates  on  the 
values  of  the  potential  and  of  its  derivatives  at  the  origin,  and  solicit  closed  expressions  for  these  quantities  are 
obtained  in  terms  of  integrals  and  sums  over  the  scattering  and  bound-state  data. 

To  obtain  these  results,  different  routes  may  be  followed.  In  keeping  with  the  multidisciplinary  character 
of  this  chapter,  the  emphasis  here  is  on  the  illustration  of  various  possibilities  that  might  be  suggestive  of 
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developments  in  other  fields,  rather  than  on  the  presentation  of  the  most  general  and  rigorous  picture.  We  therefore 
consider  the  simplest  possible  problem  (5-wxve  scattering  on  a holomorphic  potential  without  bound  states),  omitting 
technical  details  whenever  this  is  possible  without  loss  of  clarity.  We  also  review  all  the  results  that  this  line  of 
research  has  yielded  up  to  now,  referring  for  the  corresponding  proofs  and  more  detailed  presentation  to  the  original 
papers. 

The  line  of  research  covered  in  this  review  originates  from  a result  obtained  simultaneously  and  independently 
by  Newton  [1956]  and  Faddeev  [1957]  - a closed  expression  for  the  value  of  the  potential  at  the  origin  as  a sum 
over  the  8-wave  bound-state  energies  plus  an  integral  over  the  E-wave  scattering  phase  shift,  considered  as  a function 
of  energy.  In  a sense,  Levinson’s  theorem  [Levinson,  1949]  was  already  a first  instance  of  this  kind  of  result.  Sub- 
sequent developments  are  due  to  Buslaev  and  Faddeev  [1960] , Buslaev  [1962]  ,PercivaI  [1962] , Percival  and  Roberts 
[1963] , Roberts  [1963;  1964b] . These  authors,  employing  different  approaches,  obtained  results  that  implied  the 
possibility  of  obtaining  all  the  derivatives  of  the  potential  at  the  origin  in  terms  of  one  scattering  phase  shift  and  the 
corresponding  bound-state  energies,  provided  the  potential  is  an  even  function  of  r (so  that  all  its  derivatives  of  odd 
order  vanish).  However  the  conditions  to  be  satisfied  by  phase  shifts  for  the  potential  function  to  be  even  could  not 
be  obtained.  The  solution  of  the  problem  for  general  potentials  and  S waves  was  given  by  Calogero  and  Degasperis 
[1968] ; this  work  also  yields  the  conditions  that  the  5-wave  phase  shift  must  satisfy  for  the  corresponding  potential 
to  be  even.  The  extension  of  these  results  to  the  cases  of  the  Klein-Gordon  and  Dirac  equations  are  due  to  Degasperis 
[1970]  and  to  Corbella  [1970] , respectively,  and  the  treatment  of  higher  partial  waves  is  due  to  Corbella  [1971], 

The  techniques  discussed  here  produce  interesting  results  other  than  those  specifically  cited  and  can  be  applied 
in  a more  general  framework  than  that  considered  in  this  paper.  Buslaev  [1966] , for  example,  treats  the  case  of 
scattering  in  the  three-dimensional  context,  without  performing  a partial-wave  expansion,  and  elucidates  the  explicit 
relationships  between  the  scattering  amplitude  and  the  potential  that  are  implied  by  the  trace  method.  Buslaev’s  paper 
provides  a complete  review  up  to  1966,  including  the  numerous  important  contributions  (not  mentioned  here)  of  the 
Russian  school  of  mathematical  physics  and  functional  analysis;  it  is  written  in  accord  with  standards  of  mathematical 
rigor  that  are  considerably  more  stringent  than  those  adopted  in  the  present  paper. 


POTENTIAL  SCATTERING 


The  stationary  Schrodinger  equation  in  three  dimensional  space  reads 


[-A+  K(r)]  Hr)  = £*(r) 


0) 


with  units  chosen  such  that  h2/(2m)  = 1 . We  assume  hereafter  that  the  potential  V(r)  is  spherically  symmetrical,  that 
it  vanishes  at  infinity  faster  than  r~2 , 


lim  [r2  K(r)]  = 0 (2) 


that  it  is  fmite  valued  (for  nonnegative  values  of  r),  and  that  it  is  an  entire  function  of  r,  so  that  the  expansion 


n=0 


0) 
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is  convergent  for  all  values  of  r.  The  quantity  Vfj  is  clearly  the  -ith  derivative  of  the  potential,  evaluated  at  the  origin. 
This  last  assumption  is  certainly  much  more  stringent  than  is  required  for  the  validity  of  all  the  results  given  below. 
Presumably  it  would  be  sufficient  to  assume  that  V(r)  and  all  its  derivatives  are  finite-valued  for  real  nonnegative  r. 

The  scattering  solution  of  the  stationary  Schrodinger  equation  is  characterized  by  the  asymptotic  boundary 
condition 


-►  exp(ifcr) 

const exp(i  k~r  ) + f{k, 6) 

* r-»*  r 


(4) 


with 


k2  = E 


(5) 


and 


cos  ? 


t6) 


Thus  this  solution  contains  asymptotically  a plane  wave  exp  (ik-r ),  corresponding  to  the  incoming  beam,  and  an 
outgoing  scattered  wave,  whose  angular  distribution  is  characterized  by  the  “scattering  amplitude”  f ( k , 6).  The 
differential  (elastic)  scattering  section  o\E,  0)  — the  measurable  quantity  — is  related  to  the  scattering  ampli- 
tude by 


a{E,  8)  = l/(M)P 


(7) 


The  cylindrical  symmetry  of  the  problem  suggests  the  introduction  of  the  “radial  wave  functions”  <t>%(k,  r) 
through  the  position 


V“VnP8(cns  0)0fi(*,r) 

=2_j  — ; — — 

i--0 


(8) 


where  the  Pg  are  Legendre  polynomials.  The  Schrodinger  Eq.  (1)  then  yields  the  “radial  Schrodinger  equation” 


*8  (*,  r)  * [k2  - 8 (?  + 1 > r~2  - K(r) ] *8  (k,  r)  = 0 


(9) 


with  the  boundary  condition 


0)  = 0 


(10) 


5-35 


In  Eq.  (9)  and  the  following,  primes  indicate  differentiation  with  respect  to  the  last  argument.  The  hypothesis  on 
the  potential  introduced  above  ensures  that  this  boundary  condition  characterizes  uniquely,  up  to  a multiplicative 
constant,  a solution  of  Eq.  (9).  They  also  imply  for  the  asymptotic  behavior  of  the  radial  wave  functions  r) 
the  form 


4>o(k,r) 

* f -►  oo 


const  X sm 


r 


kr-t  — 
2 


01) 


The  quantities  called  scattering  phase  shifts , are  related  to  the  scattering  amplitude  by  the  equation 


f(k,0)  = k 1 ^ (2C  +1)  exp [ii?e(Ar)]  sin  Tjg(fc)  /tycosfl)  U?) 

8=0 

which  follows  from  Eq.  (11),  (8),  and  (4).  Thus,  knowledge  of  all  the  scattering  phase  shifts,  is  tantamount  to 
knowledge  of  the  scattering  amplitude  of  the  scattering  amplitude  if,  and  vice  versa,  since  obviously 


exp[i'rjs(*)|  sin  i?g(fc) 


k 

1 


sin  0 /*g( cos  6)f{k,6) 


(13) 


Note  that  in  the  sum  of  Eq.  (12)  only  the  terms  with 


8 < L =*  kr 


(14) 


where  F is  a measure  of  the  range  of  the  potential,  contribute  appreciably ; in  fact  for  8 » L the  phase  shifts 
t)  g(fc)  become  negligibly  small.  Thus,  at  low  energy  only  the  phase  shift  with  8 = 0-  the  S-wave  phase  shift  - is 
important,  and  of  course  the  scattering  becomes  isotropic. 

If  the  potential  has  a sufficiently  strong  attractive  (negative)  part,  the  radial  Schrodinger  equations  (9)  may 
have  normalizable  solutions  for  some  discrete  nonpositive  values  of  the  energy  E-k1.  The  condition*  on  the 
potential  that  we  have  assumed  are  sufficient  to  ensure  that  at  most  a finite  number  of  such  solutions  can  exist.  They 
correspond  to  “bound  states."  Every  bound  state  is  characterized  by  its  (negative)  energy  E^,  and  by  the  “nomali- 
zation  constant"  defined  by 

C8n= 

where  0g(/Tgn.r)  is  the  bound-state  wave  function,  normalized  so  that 

A|77+r]’  l<2,tl,"r' 


(15) 


(16) 
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A sufficient  conditon  to  ensure  that  no  bound  states  be  present  is  the  inequality 


drr  F(r)0[-K(r)] 


< 1 


(17) 


where  6(x)  is  the  usual  step  function, 


0(x)  * 1 x > 0 

0(x)  = 0 x < 0 


(18) 


The  inverse  scattering  problem  consists  in  the  determination  of  the  potential  V(r)  from  the  phase  shifts 
r)[(k)  (considered  as  a known  function  of  k),  and  from  the  energies  E\n  and  the  normalization  constants  C/„, 
for  a particular  value  of  l.  These  parameters  are,  in  fact,  sufficient  to  determine  uniquely  the  potential,  as  described 
in  detail  by  Newton  at  the  beginning  of  this  chapter. 

In  the  rest  of  this  section,  and  in  the  following  three,  we  consider  the  case  of  S waves  (/  = 0),  omitting 
systematically  the  subscript  zero.  We  also  assume,  for  simplicity,  that  no  bound  states  are  present  (except  in  the 
discussion  of  the  method  of  operator  traces). 

In  addition  to  the  radial  Schrodinger  wave  function  characterized  by  the  boundary  conditions  of  Eq.  ( 1 0),  it 
is  expedient  to  introduce  the  lost  solution  f(k,  r)  of  the  radial  Schrodinger  Eq.  (9)  (with  / = 0),  characterized  by 
the  asymptotic  boundary  condition 


lim  [/(*,r)exp(i!Ar)]  = 1 (19) 

r oo 


Its  value  at  the  origin  is  the  Jost  function 


f(k)  * /(*, 0) 


(20) 


'Hie  phase  of  this  function  is  just  the  scattering  phase  shift 


/(*)  * !/(*)! exp  [itj(*)l 


(21) 


Its  modulus  can  be  expressed  through  the  phase  shift  by 


where 


'/(*)!  * ex.p[A(*)] 


(22) 


(23) 
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and  P indicates  the  principal  value  of  the  integral.  Of  coune,  Eqs.  (21)  through  (23)  are  written  for  real  k . The 
inverse  square  of  the  modulus  of  the  Jost  function,  or  more  precisely  the  quantity  (1  - l/(fc)l~i ),  is  termed  the 
spectral  function , and  it  will  play  an  important  role  in  the  following.  Another  important  auxiliary  function  is 


g{k)  * ik  + — In  f(k,r) 
dr 


r = 0 


(24) 


For  real  k , the  imaginary  part  of  this  function  is  simply  related  to  the  spectral  function  by 

lmg{k)  « k 11  - l/(A)  I"1]  (25) 

which  follows  from  the  definition  of  the  Jost  function  through  the  boundary  condition  in  Eq.  (19),  implying  for  real 
k and  r 

/(-*,r)  - [/(*,r)]*  (26) 

and  from  the  Wronskian  relation 


f(k,r)f(-k,r)  f{k,r)f'(-k,r)  2 ik  (27) 

The  definition  of  /(*,  r)  moreover  implies  that  this  is  a holomorphic  function  of  k in  the  half  plane  Imk<  0 , 
and  that  the  only  zeros  of  f (k)  that  might  occur  in  this  half  plane  are  located  on  the  imaginary  axis  [as  is  easily 
proved  from  the  differential  Eq  (9)  and  the  boundary  condition  in  Eq.  (19)1  and  correspond  to  bound  states. 

Indeed  if 


f(k,r)\r  m o */(*)  ■ 0 for  k * -ip,  p > 0 then  f(k,r) 

satisfies  the  boundary  condition  of  Eq.  (10)  characterizing  the  radial  wave  function,  and  at  the  same  time  the  boundary 
condition  in  Eq.  (19)  implies  that  it  vanishes  asymptotically  proportionally  to  exp  (-pr);  thus,  f(-ip,  r ) coincides, 
up  to  a multiplicative  constant,  with  the  normalizable  wive  function  r)  of  a bound  state,  with  E * -p2 . Thus 
if  no  bound  states  are  present,  g(k)  is  also  holomorphic  in  the  lower  half  of  the  complex  k plane,  and  this  (together 
with  its  asymptotic  properties  discussed  below)  implies  that  for  real  k its  real  part  can  be  expressed  as  an  integral 
over  its  imaginary  part,  namely 


Keg(k) 


(28) 


To  obtain  this  equation  we  have  also  used  the  reflection  property 
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Img(-k)  ■ - lmg(k ) 


(29) 


that  follows  from  Eq.  (25)  and  from 


!/(-*)!  « !/(*)! 


This  hst  equation  is  implied  by  Eqs.  (26)  and  (21)  together  with 


!}(-*)  “ -!}(&,! 


As  for  the  reflection  property  of  the  real  part  of  g(fc),  clearly  Eq.  (28)  implies 


Reg(-k)  = Reg(k) 


Equations  (28)  through  (32)  are,  of  course,  for  real  k . 

The  convergence  of  the  integrals  in  Eqs.  (23)  and  (28)  is  implied  by  the  asymptotic  vanishing  of  *h<;  phase 
shift  ri(k)  and  of  the  spectral  function  1 - l/(fc)  I . In  fact,  the  properties  that  we  have  assumed  for  the 
potential  imply  that  the  phase  shift  rj(fc)  admits  the  asymptotic  expansion  [Verde,  1955] 


i y 

)(*)  = ^ an  Jr*""1  + 0{k  <N~3) 


and  this  result  with  Eqs.  (22)  and  (23)  implies  for  A(k)  the  asymptotic  expansion 


d„kr*n  + 0(k-*-V-*) 


and  therefore  for  I /(it)  I the  asymptotic  behavior 


!/(*)!  ■»  1 + (Hit-1) 


A more  detailed  discussion  of  these  asymptotic  expansions  is  given  in  a later  sectio. 


THE  METHOD  OF  FUNCTIONAL  INTEGRATION 


On  the  basis  of  the  Gelhnd-Levitan  solution  of  the  inverse  scattering  problem,  Newton  [1956, 1966]  o'  ^ned 
the  following  expression  for  the  functional  derivatve  if  the  potential  K(r)  with  respect  to  the  scattering  phase  shift 

V(k): 


where 


6KW 

6rt(k) 


(36) 


4 d 

= — G(k\r ) 

tr  dr 


G(k2,r)  = ~(2k)  1 In: {exp  (2it}(*)]  /*  (-*,/■)} 


(37) 


and  /(*,  r)  is  the  Jost  solution  of  the  radial  Schrodinger  equation.  In  writing  these  equations,  and  always  in  this 
section,  we  assume  k to  be  real.  The  fact  that  G(k2,  r)  is  a function  of  k2  rather  than  A;  is  implied  by  the 
symmetry  and  reality  properties  of  ij(k)  and  f(k,r)  (Eqs.  (26), (31),  and(2I)]. 

The  validity  of  Eq.  (36)  is  elucidated  in  the  following,  when  we  tackle  the  problem  of  its  integration.  For  the 
moment  we  limit  ourselves  to  specify  that  whenever  we  write  a functional  derivative,  we  understand  .he  momentum 
variable  k to  extend  over  the  range  -«»,  +«°;  the  values  of  the  scattering  phase  shift  for  negative  k are  obtained 
through  the  reflection  formula  in  Eq.  (31)  from  the  “experimental”  values,  corresponding  by  convention  to  non- 
negative k (see  Eq.  (4)J  (note  that  for  positive  k the  scattered  wave  is  outgoing,  because  the  time-dependence 
characteristic  of  a stationary  function  consists  of  the  multiplicative  factor  exp(-r'A:2f )). 

We  now  note  that  the  definition  of  G(k2  ,r)  (Eq.  (37)]  and  the  differential  Eq.  (9)  satisfied  by  f(-k,r)  imply 
that  G(k2,  r)  satisfies  the  third-order  equation 

(t"(k2 ,r)  = 4[K(r)  - k2 ] G'(k2,r)+  2V\r)G(k2 ,r)  (38) 


with  boundary  conditions 


G(k2,0)  * 0 

(39) 

G'(k2,  0)  = -1 

(40) 

G '(k2 , 0)  - -2 Reg(k) 

(41) 

We  introduce  now  the  power  expansion  of  G(k2,  r). 


, X~^Gn(k2)rn 

G(k2,r)  = y (42) 

//=  i 

The  convergence  of  '!us  expansion  is  implied  by  the  assumed  holomorphy  of  the  potential  [Verde,  1955] . introduc- 
ing this  expansion,  together  with  that  of  the  potential  [Ef\  (3)] , in  the  differential  Eq.  (38),  we  get  the  recursion 
relations 


Gn+3  (k2)  = -4k2  G^t  (k2)  + 


n+i 


m= i 


Gm(?)  vn+i~m 


(43) 
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while  «he  boundary  conditions  in  Eqs.  (39)  through  (41)  imply 


Gi(k 2)  = -1 

(44) 

Gj(k2)  = -2  Reg(k) 

(45) 

But  from  Eq.  (36)  we  infer  that 


*W*J) 


(46) 


and  inserting  this  relation  in  the  recursion  relations  of  Eq.  (43)  we  obtain 

n 

Sfr+2  _ _4t,  *Jk  +2\^  "+”'  + 2 (n+l\y  ilk 

613 (*)  ~ 6t ik)  / s n + 1 \m+l)Vr™  61 j(k) 

m= 0 


(47) 


Here,  of  course,  the  quantities  Vn  are  considered  functionals  of  the  phase  shift;  recall  that  because  we  are  limiting 
our  consideration  to  the  case  without  bound  states,  to  every  jj(Jt)  there  corresponds  a unique  potential  F(r),  and 
therefore  definite  values  for  all  the  quantities  Vn . 

This  system  of  recursive  functional  differential  equations,  together  with  the  two  starting  conditions 


and 


6K0  _ _4 
rr 


(48) 


implied  by  Eqs.  (46),  (44),  (4 5),  (28),  (22)  and  (23),  allow,  by  sequential  functional  integration,  evaluation  of  the 
quantities  Vn  - namely,  the  values  of  the  potential  and  its  derivatives  at  the  origin,  in  terms  of  the  scattering  phase 
shift  17(h).  The  functional  integration  is  accomplished  through  the  formula  [Vol terra,  19S9] 


- , fs  /“+«>  61^ 

Vn  * Vn  +(*-T)"‘  L dr  / d * [tj(*) -»?(*)]  TT 

Js  6171 


5 Vn 
6t ?’(*) 


(50) 


n'(k)  = (s  - r)  ■'  l (f  - s )rt(k)  - (t  - s)rj(k)] 
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where  Vn  is  the  nth  derivative  (evaluated  at  the  origin)  of  the  potential  V(r)  corresponding  to  the  phase  shift 
r)(J k),  just  as  Vn  is  the  nth  derivative  of  the  potential  V(r)  corresponding  to  the  phase  shift  17 (k). 

A difficulty  is  apparent,  however,  from  the  structure  of  this  equation.  Clearly,  to  be  valid,  the  asymptotic 
vanishing  at  large  k of  the  difference  r)(k)-  rj(k)  must  be  sufficiently  rapid.  Using  the  fact  that 

f(k,r)  = e~ikr[\*0{fTl)\  (51) 

at  large  k we  get  from  Eqs.  (36)  and  (37)  the  estimate,  valid  at  large  k , 

6 V(r)  4 

— = — k cos(2 kr)  (52) 

6tj(J k)  n 


and  from  this  we  get  by  differentiation 
SfW(r) 


«>?(*) 

bv(k) 


= (-)”,J — ln  kfi+l  cos(2fcr)  for  even  n 
ir 


(-)(n+1)  'J  — 2n  *"+l  sin(2Ar)  for  odd  n 


m 


(54) 


where  V^{r)  indicates  of  course  the  nth  derivative  of  V(r).  We  therefore  conclude  that  a sufficient  condition  for 
the  validity  of  Eq.  (SO)  is  that 


m-m  = o(k~n~3) 

at  large  k . Using  the  asymptotic  expansion  in  Eq.  (33)  we  see  that  this  happens  provided 


(55) 


/ = 0,1,...,  n/2 

for  even  n 

(56) 

/ = 0, 1, . . . ,(n  + l)/2 

for  odd  n 

(57) 

We  are  finally  in  the  position  to  begin  the  process  of  functional  integration.  Applying  the  formula  (50)  to 
Eq.  (48)  we  get 

— 4 f +°°  _ 

V0  = K0+—  / d**{rK*)-T?  (*)1  provided  a0  * a0  (58) 

n 


or,  equivak.itly, 
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J“  dk  k |q(k)  - = K0 J d*  provided  a0  = 


a o 


(59) 


the  constants  a0 , a0  being  defined  by  Eq.  (33)  and  by  a similar  formula  for  r j(it)  in  which  the  constants  an 
appear.  This  equation  may  also  be  written  in  the  form 


4 I*"1"00  f T)(k)  - a0~\ 

” JL  T~ Ho(flo) 


(60) 


where  the  function  A0  is  a universal  function  of  its  argument.  This  function  may  be  easily  evaluated  using  a simple 
potential  whose  phase  shift  is  exactly  known,  and  it  turns  out  to  vanish  identically  (when  no  bound  states  are 
present).  Therefore,  we  can  write 


(61) 


(62) 


The  equivalence  of  Eq.  (62)  to  Eq.  (61)  is  easily  checked  by  partiaLintegration  (see  Appendix  A). 

Equations  (61)  and  (62)  provide  an  explicit  expression  for  the  value  of  the  potential  at  the  origin.  They  were 
first  obtained  by  Newton  [1956] , using  the  same  procedure  as  here,  and  by  Faddeev  [1957] . Both  authors  obtained 
a more  general  version;  for  instance,  Newton’s  result  can  be  written: 

Vo  « ~ (28  + I)-1  I d*  f-*  -i-J  [*ng(*)]  + 4(28  + 1) _1  ^ % (63) 

" Jo  \ * n 


where  the  sum  is  extended  over  the  energies  of  all  the  bound  states  with  given  8.  This  formula  is  more  general  than 
Eqs.  (61)  and  (62)  in  two  respects:  it  includes  the  possibility  that  bound  states  are  present,  and  it  applies  to  all  values 
of  8,  not  only  the  5-wave  case  (8  = 0). 

Continuing  the  process  of  sequential  integration,  and  applying  the  rule  (50)  to  Eq.  (49),  one  obtains 

Vt  - Vi~^  /_>,  [I/WI-*  + 2A(k) - I /(*)!-*  - 2A(*)]  provided a0  =a0,a  = dt  (64) 


where  A(k)  and  I /(A)  I are  given  in  terms  of  tj (*)  by  Eq.  (22)  and  (23),  and  A(Jt),  l/(Jk)l  are  given  by  similar 
equations  in  terms  of  rj(k).  Again  one  can  conclude  from  this  formula  that 


Vt 


r 


d**2  [i/wr2 


+ 2A(*)-1]  = A ,(«„,«,) 


35) 
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where  A\ (o0>ai ) is  a universal  function  of  its  arguments.  This  function  coulri  be  found  using  a simpl-  solvable 
potential  (containing  at  least  two  parameters),  but  we  defer  to  a later  section  a derivation  of  the  final  form  of  this 
equation.  Note  that  the  integral  in  Eq.  (65)  is  convergent,  because  the  integrand  vanishes  asymptotically  as  k~2  ; 
this  is  implied  by  Eq.  (22)  and  (34). 

The  process  of  sequential  integration  can  be  continued,  using  the  recursion  relations  (47)  and  the  explicit 
expressions  of  the  Vn  that  are  successively  determined.  For  instance,  the  next  functional  equation  to  be  integrated 
would  be 

5 16  ( 3 4 d | 

— ' = S*  +~  / dflo  ~ fah(<?)]>  (66) 

5v(k)  it  | it  J ^ J q i 

Clearly  the  formula  (50)  is  always  adequate  to  perform  the  integration,  although  the  labor  involved  gits  progressively 
m^re  difficult. 


THE  METHOD  OF  OPERATOR  TRACES 


A complete  treatment  of  the  method  of  operator  traces  has  been  given  by  Buslaev  [1966]  in  the  context  of 
three-dimensional  scattering  theory.  Here  we  try  to  convey  the  main  idea  of  this  method  in  the  most  elementary 
setting,  following  the  work  of  Percival  [1962]  and  Buslaev  and  Faddeev  [1960] . In  terms  of  the  problem  under  con- 
sideration, the  main  limitation  of  this  approach  is  its  inability  to  yield  an  expression  for  the  first  derivative  of  the 
potential  at  the  origin,  and  in  general  to  deal  with  potentials  that  are  not  even  (so  that  their  derivatives  of  odd  crder 
do  not  all  vanish  at  the  origin).  On  the  other  hand  the  great  mathematical  generality  of  the  approach  has  a great 
potential  for  applications  in  other  fields  of  mathematical  physics.  Dikii  [1958]  has  reviewed  the  trace  method  in  the 
general  context  of  the  theory  of  Sturm-Liuville  differential  operators. 

The  basic  idea  of  this  approach  is  best  introduced  considering  the  hamiitonian  H of  a quantum-mechanical 
system  that  admits  only  a finite  number  .V  of  linearly  independent  states.  Such  a hamiitonian  may  be  represented 
by  an  NXN  matrix,  and  the  invariance  of  the  trace  of  a matrix  under  unitary  transformations  implies  the  weil-kr.own 
formula 


i 


N 

n\HP\  - £ £/ 

n=l 


where  the  En' s are  the  eigenvalues  of  H and  p is  an  arbitrary  (integral)  exponent. 
Our  problem  is  characterized  by  the  hamiitonian 


H = T+  V 


dr2 
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(67) 


(68) 


+ V(r) 


(69) 


whose  eigenvalues  are  all  the  positive  values  E = k1,  and  possibly  a finite  number  jf  negative  values  En , correspond- 
ing to  (S-wave)  bound  states,  as  well.  Here  we  keep  open  the  possibility  that  bound  states  exist,  since  their  presence 
does  not  cause  any  additional  complication  - on  the  contrary,  it  may  even  make  the  treatment  more  transparent. 

There  is  now  a continuous  infinity  of  eigenvalues,  so  that  the  formula  equivalent  to  Eq.  (67)  reads 


n[HP]  = (En)P  + f°°AEEPn(E) 


(70) 


where  n(E)  is  the  density  of  states  ir  the  continuum.  This  formula,  however,  has  only  a symbolic  meaning,  because 
the  integral  in  the  right-hand  side  generally  does  not  converge  (nor,  of  course,  does  the  trace  in  the  left-hand  side). 

On  the  other  hand,  because  the  density  of  states  n(E)  is  simply  related  to  the  scattering  phase  shift  rj{k)  (see  below), 
if  it  were  possible  to  assign  a definite  meaning  to  this  formula,  one  would  obtain  relationships  between  the  energy 
moments  of  the  scattering  phase  shift  on  the  right-hand  side,  and  the  potential  that  is  contained  in  the  hamiltonian 
operator  on  th*  left-hand  side.  Thk  motivates  the  attempt  to  extract  definite  relationships  from  the  symbolic  Eq.  (70). 

TK-  first  trick  to  use  is  to  subtract  the  same  expression  for  the  case  without  interaction  from  Eq.  (70),  obtaining 


tr[HP-TP]  = £ (En)p+  J "df  £?«/£)  (71) 

n 0 

where  nfJE)  is  now  the  increment  in  the  density  of  states  due  to  the  interaction.  It  is  well  known  (see  Appendix  B) 
that 

*/£)  = (2nk)~l  jr  r l{k)  (72) 


where  r\(k)  is  the  scattering  phase  shift.  Thus,  in  place  of  Eq.  (71)  we  may  write 

tr  [Hp  -TP]  = V*  ( En)P  + — * f°°dk  klp  -jj-  V{k)  (73) 

Lmmd  n r 0 dk 

n 

Of  course  this  equation  has  still  only  a symbolic  meaning,  because  in  general  the  integral  in  the  rig,  . land  side  does 
not  converge.  For  p = 0,  however,  the  equation  is  directly  meaningful,  and  it  implies  the  result 


t|(0)-i?(«»)  - Nit 


(74) 


where  N is  the  number  of  (5-wave)  bound  states.  This  is  just  Levinson’s  theorem  [Levinson,  1949). 

We  now  outline  the  two  approaches  of  Percival  [1962]  and  Buslaev  and  Faddeev  [1960]  to  extract  meaningful 
relationships  from  the  symbolic  Eq.  (73). 
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Perival  Approach 


In  place  of  Eq.  (73)  one  substitutes  the  equation 

-eh*  (75) 

ak 


which  is  now  certainly  meaningful  for  positive  0.  This  approach  is  very  close  to  that  employed  in  certain  problems 
of  statistical  mechanics;  indeed,  this  equation  can  be  found  in  an  old  paper  by  Uhlenbeck  and  Beth  [1936] , where  it 
is  used  to  study  the  virial  expansion  for  a nonideal  gas.  It  is  then  noted  that,  while  an  expansion  or  Eq.  (75)  in  powers 
of  0 is  not  allowed,  as  it  would  merely  reproduce  Eq.  (73),  one  can  expand  in  powers  of  (3 1/5  and  equate  the  coef- 
ficients of  equal  powers  of  01'2 . (This  corresponds,  in  the  context  of  statistical  mechanics,  to  a rearrangement  of 
the  Kirkwood-Wigner  expansion  of  the  partition  function  [Landau  and  Li f shits,  1959] ).  In  this  manner  we  obtain 
the  moment  relationships 


ij" 


dice 


— f dr  f dkexp(/lcr)[(7'+  V)P  - T?]  exp(ifcr) 

4;r  — oo  oo 


_ n-° 


for  p=  1,2,3,. . . (76) 


The  left-hand  side  originates  from  the  trace,  operation;  the  right-hand  side  contains  the  coefficients  an  characterizing 
the  asymptotic  behavior  of  tj(Ac)  through  Eq-  (33).  Using  the  results  of  Appendices  A and  C,  Eq.  (76)  can  be  rewritten 
in  the  more  compact  form 


K(r)--(d- 

4 


d) 


} -V  l£ny>X*  1/ 

I , = 0 " [_ti(2p-l)!j  ° 


dkD2P~l  T^k)  for p=  1,2,3,... 

(77) 


where  the  operators  d , d and  D are  defined  by 


fir)  d g(r)  h f(r)g'(r)J(r)  d g(r)  = f\r ) g (r),  and 


D 


k 


(see  Appendices  A and  C). 

To  obtain  Eq.  (76),  and  therefore  Eq.  (77),  however,  it  is  necessary  to  assume  that  the  potential  function  V(r) 
is  not  only  holomorphic,  but  also  even 

d ln+l  V{r) 

S ~^nTi~  = R for  » 58  0.1 .2,  • • • (78) 

ar  r- 0 
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The  values  of  V(r)  for  negative  r occurring  in  the  integral  in  the  left-hand  side  of  Eq.  (76)  are  then  defined  by 


\ 

V', 


V(-r)  = K(r) 


(79) 


“V 

A 

V 


This  prescription,  together  with  the  asymptotic  vanishing  of  V(r),  ensures  convergence  of  the  integration  over  the 
variable  r. 

The  treatment  of  Appendix  C implies  that  Eq.  (77)  can  be  rewritten  in  the  form 


(Enf  + (~fH 


2tp+3  f 

*(?P  + 1)!  J0 


AkD2P+l  m + Fp  ( V0 , V2, . . , Vtp-i) 


(80) 


where  Fp  is  defined  by  the  equation 


Ep  (Po.  Fj>  ■ • • , V2p-i)  = (-JP+1 


(81) 


and  depends  only  on  the  Vn  whose  order  n does  not  exceed  2p-2.  It  is  therefore  clear  that  from  Eq.  (80)  it  is 
in  principle  possible  tc  obtain  sequentially  all  the  even  derivatives  of  the  potential  in  terms  of  the  generalized  moments 

P-1.2.3,...  (82) 

The  derivative  Vip  of  order  2p  depends  only  on  the  moments  whose  order  n does  not  exceed  p + 1.  Of  course, 
t*»e  knowledge  of  all  the  even  derivatives  is  sufficient  to  reconstruct  the  whole  potential  since  it  is  assumed  to  be  even. 
On  the  ''♦her  hand,  the  cc'.^tio.v-  that  the  phase  shift  and  the  bound-state  parameters  must  satisfy  so  that  th 
corresponding  potential  le  fen  are  far  from  trivial;  moreover,  their  investigation  is  beyond  the  power  of  the  approach 
.ve  have  just  described. 

Explicit  expression.-  of  the  first  derivatives  of  the  potential,  evaluated  at  the  origin,  in  terms  of  the  phase  shift 
and  the  bound-state  data,  are  given  in  a later  section  (for  the  general  case  of  a potential  that  need  not  be  even); 
therefore  we  do  ot  report  here  the  results  that  obtain  from  Eq.  (80). 


Buslaev  and  Faddeev  Approach 

The  approach  of  Buslaev  and  Faddeev  [1960] , although  originating  from  the  method  of  operator  traces,  is,  in 
fact,  closer  to  the  approach  based  on  -the  as;-  mptotic  expansion  of  the  scattering  parameters,  and  on  their  analyticity 
properties  (see  the  next  section).  This  method  is  also  applicable  to  potentials  that  are  not  even,  but  it  can  yield 
explicit  expressions  for  all  the  derivatives  of  the  potential  at  the  origin  only  in  the  case  of  even  potentials-that  is, 
only  if  all  die  derivatives  of  odd  order  are  a priori  known  to  vanish  at  the  origin. 
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The  basic  equation  of  the  Buslaev  and  Faddeev  [1960]  method  is 


£ (-£•*)*  = 0 
n 


that  holds  in  the  strip  of  the  complex  z plane  characterized  by  the  inequality 

1 

0 < Re  z<  — 

2 


sin(rrz)  f d k k2z  1 rj(fc)  + cos  (trz)  f dk  klz  ‘lnl/(fc)l  + 

•vi 


2z 


(83) 


(84) 


This  equation  is  established  by  contour  integration  of  the  function  fcJZ(d/dit)  lr./(fc),  exploiting  t’  e holomorphy  of 
f(k)  in  the  lower  half  plane  dm  k < 0,  and  the  fact  that  its  zeros  there  correspond  to  the  bouRd  s.a  es  as  indicated 
earlier. 

The  similarity  of  this  equation  to  Eq.  (73)  should  be  noted,  as  also  the  disappearance  of  any  explicit  dependence 
on  the  interaction.  Of  course  the  restriction  to  the  strip  0 <Re  z < 1/2  is  essential.  The  moment  relationships  are 
obtained  by  analytic  continuation  outside  this  strip,  to  the  vain*®  :-p  + 1/2  and  z~p,  with  p = 1, 2,  3, . . . 

This  continuation  leads  to  formulas  that,  using  the  results  of  Appendix  A may  be  cast  into  the  form 


ap  “ (“)p 


f°°dkDiP  In  \ftk)\ 
Jo 


dp  = 


7t(2 p - 1)!  Jo 


X 


dk  D2P~l  t?(*) 


(85) 


(86) 


where  ap  and  dp  are  the  coefficients  of  the  'asymptotic  expansion  of  r?(fc)  and  A(fc),  Eqs.  (33)  and  (34).  Using 
Eq.  (22),  or  rather  its  generalized  version  when  bound  states  are  present,  which  reads 

l/(fc)l  = [ri(i  -fc_5£w)J  exp  [A(^)]  (87) 

with  A (k)  always  defined  by  Eq.  (23),  it  is  possible  to  recast  Eq.  (85)  into  the  form 


aP 


2 

rr(2p)! 


f°°dkD2P  A (*) 
** o 


(88) 


Actually  it  is  possible  to  derive  these  explicit  expressions  for  the  asymptotic  coefficients  an  and  dn  by  more 
straightforward,  if  perhaps  less  rigorous,  techniques  [Roberts,  1964] . 

The  results  embodied  >n  Eqs.  (85)  and  (87)  do  not  provide  an  explicit  connection  between  the  energy  moments 
of  the  phase  shifts  [and  the  logarithm  A(k)  of  the  modulus  of  the  Jost  function] , the  bound-state  energies,  and  the 
interaction.  Such  a connection  is  introduced  using  the  explicit  asymptotic  expressions  of  the  phase  shift  and  the 
modulus  of  the  Jost  function,  or  of  its  logarithm,  in  terms  of  the  potential.  Such  expressions  are  derived  and  discussed 
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in  the  following  section,  using  a technique  due  to  Verde  [1955].  Similar  results  have  been  obtained  with  a 
different  procedure  by  Buslaev  and  Faddeev  [1960],  which  state  that 


ap  = (“)p  2-^'  Qip+l  (89) 

dp  = p“  ^(£„)p  + (-)P+1  2~7P  Qlp  (90) 

n 


where  the  coefficients  Qn  are  obtained  from  the  recursion  relations 


Qp  = Vp- 1 + ^2  2/ 

/='  7 


(91) 


where,  by  definition, 


Kj  = .Urn  [Kk<r)]  (92) 

and  the  functions  V^r)  are  obtained  by  recursion  from 

F0(r)  = - f\v(s)  (93) 


and 


2-1  -r 

fy')  - ^>(0)+£  (*’')  J dsKm.(s)  Kg 


!-» -m) 
is) 


(94) 


m-0 


As  usual,  F^Vr)  indicates  the  nth  derivative  of  the  potential,  so  that  K^”\o)  = V„  are  the  quantities  we  are 
trying  to  compute.  Note  that,  for  p = 1,  Eq.  (91)  yields 


G.  = K 


(95) 


The  recursion  relations  (91)  are  solved  by  the  formula  [Calogero  and  Degasperis,  1968] 


(96) 
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where  the  sum  extends  over  ail  positive  integral  values  of  £ and  jg  and  all  nonnegative  integral  values  of  Sj 
(/  = 1,  2,  1)  such  that 


£ 

sj  = p (97) 

i*  l 

Explicit  expressions  of  the  first  four  coefficients  Qp  are 


Q\  = 

- /”d rV(r) 

(98) 

Qs  = 

V(Q) 

(99) 

Qs  * 

F'(0)+  /°°dr^(r) 
0 

(100) 

Q 4 - 

V"(0)-2V3(0) 

001) 

inserting  these  expressions  of  Q2  and  Q4  in  Eqs.  (90)  and  (86)  Buslaev  and  Faddeev  [1960]  obtain  explicit  expres- 
sions for  V0  = V(0)  and  V2=  K''(0)  in  terms  of  the  generalized  energy  moments  of  the  scattering  phase  shift  given 
in  Eq.  (82).  It  is  however  clear  that  this  approach  cannot  produce  an  explicit  expresssion  for  the  first  derivative 
Vx  - V'{0)  of  th.'  potential  at  the  origin,  nor  for  the  higher  derivatives,  except  in  the  case  of  an  even  potential;  for 
instance,  it  is  easily  seen  that  Qt  is  a linear  combination  of  VA.V2  Vq,(V,  )2 , and  (K0)3. 


THE  METHOD  OF  ASYMPTOTIC  EXPANSIONS 


This  method  is  based  on  the  relationship  between  the  asymptotic  expansion  of  the  scattering  parameters,  and 
the  interaction. 

We  begin  with  an  analysis,  following  Verde  [1955] , of  the  asymptotic  expansion  at  targe  k of  the  auxiliary 
function 


g(*.  r)  » ik  + 


fXKr) 

/(M 


(102) 


The  radial  Schrodinger  equation  satisfied  by  f{k,  r)  implies  for  g(k,  r)  the  Riccati  equation 


g(k,r)  - V{r)  + Ukt{k,r)-e{k,n 


(103) 


5o0 


v.hile  the  condition  characterizing  the  Jost  solution  f(k,  r),  Eq.  (19),  implies  that  g(k,r)  vanishes  at  large  r : 

lim  (g(fc,  r)]  =0  (104) 

f OO 


On  the  other  hand,  at  the  origin 


f (*.  0)  = g(k) 


(105) 


where  the  function  g(k)  is  the  same  one  introduced  in  Eq.  (24). 
Introducing  the  asymptotic  expansion  at  large  k 


N 


g(k.r)  = > gn(r)(-2ffc)-"-‘ + 0(k-v-J) 


(106) 


into  the  Riccati  Eq.  (103),  we  obtain 


n-1 

g'n(0  + fn+iW  + gm(r)gn-t  -m(r)  * 0 - n > 1 
m=0 


(107) 


These  recursive  relations,  together  with  the  starting  conditions 


= V(r) 

(108) 

gl(r)  = -V'(r) 

(109) 

and  the  asymptotic  property 

®iH  * o 

(HO) 

are  sufficient  to  determine  all  the  functions  gn(r)  in  terms  of  the  potential, 
algebraic  expression  involving  V^  (r)  for  p up  to  n ; for  instance, 

Moreover,  it  is  clear  that  gn(r)  is  an 

*s(r)  - V'\r)-V\r) 

(111) 

g,(r)  - -V"\r)~  4 V(r)  V\r) 

(112) 
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and  so  on.  It  is  clear  from  the  structure  of  these  equations  that  they  can  be  inverted,  so  as  to  obtain  V^P\r)  as 
an  algebraic  expression  involving  the  quantities  gn(r)  for  n up  to  p ; and  it  is  easily  seen  that  the  ,.-neral  structure 
of  the  resulting  expression  is 


V^Xr)  = - n h/f)  013) 

where  the  sum  extends  over  all  positive  integers  £ and  over  all  sets  of  nonnegative  integers  Sj  (a  set  being  by  defini- 
tion independent  from  the  order  in  which  the  indices  Sj  appear),  and  the  coefficients  j p,  £ ls1 , s2 , . . . . Sg|  vanish 
unless  the  equality 

£ 

£ */«  p-28  + 2 (114) 

M 

is  satisfied.  This  “dimensional”  condition  severely  restricts  the  number  of  terms  in  the  sum.  For  instance  the 
allowed  values  of  £ are  restricted  by 


p 

£ < 1 + — 
2 


0 


The  coefficients  j p,  £ I j, , s2 Sj  | are  discussed  in  detail  by  Calogero  and  Degasperis  [1963] ; a table  of  all 

their  values  for  p up  to  10  is  also  given. 

For  r * 0 Eq.  (1 13)  becomes 


vp  * I!  *Sj  (116) 

where  the  quantities 

gs  *fc(0)  (117) 

are  the  coefficients  of  the  asymptotic  expansion  of  g(k): 


N 

g(k)  - £ ^(-2/*)-"-'  *0(k~^)  (118) 

«■ o 

To  realize  the  task  stated  in  the  title  of  this  paper,  it  is  now  sufficient  to  express  the  quantities  gn  in  terms  of 
the  scattering  phase  shift.  This  can  be  easily  done  starting  from  the  explicit  expression  of  g(k)  in  terms  of  the  phase 
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shifts  [Eqs.  (25),  (28),  (22),  and  (23)]  and  exploiting  thi  analyticity  properties  of  g{k),  according  to  a technique 
first  suggested,  in  a similar  context,  by  Roberts  [1964b] . We  report  here  the  results,  relegating  their  proof  to 
Appendix  D.  They  consist  of  the  formulas 


/•OO 

2jp+3  r 

019) 


foo 

d kD2P+3  [fc~n./(fe)l“a  ] 


gsp  = 2JP+' 


E n 


(-2d/)V 

*/! 


(120) 


(121) 


the  sum  in  the  last  equation  being  extended  over  all  positive  integral  values  of  $ and  Sg  and  nonnegative  integral 
values  of  sy,  0 = 1,  2, ....  8 - 1),  such  that 


jSj  = P+  1 


022) 


Note  that  this  last  condition  restricts  severely  the  range  of  aliow*d  values  of  £ and  of  the  indices  sy;  for  instance,  it 
implies 


1 < t < 1 ♦ p (123) 

The  quantities  dy  in  Eq.  (121)  are  proportional  to  the  moments  of  the  scattering  phase  shift,  being  defined  by 

d , | dkD'P-'»(k)  (124) 

tr(2p-l)!  J0 

and  they  coincide  with  the  coefficients  of  the  asymptotic  expansion  of  A(*)  (Eq.  (34)] . The  operator  D entering  in 
this  equation  and  in  Eqs.  (119-1 20)  is  the  same  one  discussed  in  Appendix  A,  namely, 

D = * 4-*  (125) 


Note  that  Eq.  (1 20)  obtains  from  (1 19)  performing  one  integration  by  parts  (see  also  Appendix  A). 


We  thus  see  that  the  parameters  gn  of  the  asymptotic  expansion  (1 18)  of  g{k)  can  be  expressed  explicitly  in 
te.ms  of  the  scattering  phase  shift;  the  parameters  g„  with  n even  are  algebraic  combinations  of  the  energy  moments 
(124)  of  the  scattering  phase  shift,  while  the  parameters  gn  with  n odd  are  directly  proportional  to  the  generalized 
moments  of  the  spectral  function  1 - I /(Jt)l-1  [and  this  function  can  be  obtained  from  the  scattering  phase  shift 
through  the  explicit  formulas  (22)  and  (23)] . Inserting  these  expressions  in  Eq.  (1 16),  we  obtain  the  final  expression 

P+1 

vp  = ^ -.*g+.]  I~[  ( Jj)*i  (126) 

where 

21/+3  reo 

hi*  / d kD'r-'v(k)  (127) 

• »(2/-l)!  *o 

= (-)/  22f*'  d j (128) 

and 

hj* t = (')'  ~ — ,77-  / d*Da/+*  [jr«l/(k)l^]  (129) 

' *(2/  * 1)!  *o 


- gij-i 


(130) 


The  sum  in  Eq.  (126)  extends  over  all  positive  integral  values  of  £ and  s^,  and  all  nonnegative  integral  values  of 
Sj  0 = 2,3 £),  such  that 


jSj  = p + 2 (131) 


Thus  the  maximum  value  of  £ appearing  is  (126)  is  p+1.  The  coefficients  [p,  £lsj..ss, . . . ,Sj+j]  are  related  in 
an  obvious  way,  through  Eqs.  (1 19-121)  and  (127-130),  to  the  coefficients  {p,  8 I s i , s2 , . . . , Sj}  of  Eq.  (116). 
They  have  been  computed  by  CMogero  and  Degasperu  [1968]  for  all  values  of  p up  to  10,  and  are  given  in  Table  I 
of  their  paper.  Using  these  coefficients,  it  is  possible  to  v rite  the  values  at  the  origin  of  ' • potential  and  its  first  10 
derivatives,  in  terms  of  the  phase  shift  The*'  explicit  ex ' dons  are  given  in  the  following  lection,  which  contains 
a general  summary  of  the  results  for  the  case  of  S-wavr  tv  • jring  on  a holomorphic  potential,  including  the  possi 
bility  that  bo’ind  states  are  present 

Finally,  we  note  that  while  the  general  philosophy  of  the  approach  given  in  this  section  was  introduced  by 
Robert*  [19646] , his  method  could  not  yield  all  the  results  reported  here,  because  it  focused  on  the  asymptotic 
expansion  of  the  function 
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( i 32) 


rather  than  on  the  function  g(k,  r).  As  explained  in  detail  by  C<dogero  and  Degasperis  [1968]  (see,  in  particular, 
their  Appendix  H),  the  information  derivable  from  a study  of  the  asymptotic  expansion  of  h(k)  corresponds  as  far 
as  the  values  of  the  potential  and  its  derivatives  ai  the  origin  are  concerned,  only  to  that  derivable  from  an  exuar.a;on 
of  the  odd  part  of  g(k),  namely  that  transmitted  by  the  coefficients  gn  with  n even  [see  Eq.  (118)) . Thus  Roberts 
[1946b]  could  not  obtain  an  expression  f ir  F'(0)  = V1%  nor  could  he  obtain  expressions  for  Vp  for  p>  2,  except 
in  the  special  case  of  an  even  potential- that  is,  a potential  such  that  all  Vp  with  odd  p vanish. 


S-WAVE  SCATTERING  ON  A HOLOMORPH1C  POTENTIAL:  SUMMARY  OF  RESULTS 


Let  Tj(it)  be  the  S-wave  scattering  phase  shift  produced  by  the  potential  K(r),  En  the  (negative)  energies  of 
the  corresponding  (S-wave)  bound  states,  and  Cn  the  normalization  coefficients  of  these  bound  states  [defined  by 
Eqs. ( 1 S) and (16), with  8 = 0]. 

Let  Vp  be  the  value  at  the  origin  of  the  pth  derivative  of  the  potential  (assumed  to  be  holomorphic  at  r = 0 
and  to  satisfy  the  usual  requirements  in  scattering  theory,  detailed  earlier): 


p=  0 


(133) 


Then  the  following  formula  holds: 


ES+l 

.jg+all  Vj)s’ 

/= 2 


(134) 


the  sum  being  extended  over  all  positive  integral  values  of  8 and  Sg+1  and  nonnegative  integral  values  of  Sj  that 
are  consistent  with  the  dimensional  condition 


8+1 

£ ;s;  = p + 2 (i:j) 

/= 2 


The  universal  coefficients  [p,  8 lij,  j3,  . . . , Sj+I]  depend  only  on  the  indices  shown;  procedures  to  compute  them 
together  with  a table  'r  all  the  nonvanishing  coefficients  for  p up  to  10  [see  Eqs.  (145)  through  ( 1 55)]  are  given 
by  Calogero  and  Degasperis  [1968] . The  integrals  Jn  with  even  n are  essentially  the  generalized  momentum 
moment  'of  odd  order  n - 1)  of  tlte  scattering  phase  shift;  they  depend  on  the  phase  shift  n(k)  and  on  the  bound 
state  energies  En,  but  not  on  the  normalization  coefficients  Cn : 
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. (-,/*.  */»{}  £ tui  ♦ !*>/-'  ,w 


- (-)/+'  2^jiV  (£«)/-  — f d**»HLft)-2  a/1*"2""' 
( I -V  ff  •'o  L 


The  coefficients  an  entering  in  Eq.  (137)  are  those  characterizing  the  asymptotic  behavior  of  the  phase  shift 


iT 

= ^ a n k~2n~l  +0(k'2N'3  ) 


They  can  be  also  defined  by  the  integral  expressions 


d2n+1 


The  integrals  Jn  with  odd  n are  essentially  the  generalized  momentum  moments  (or  order  n - 1)  of  the  spectral 
function  1 - ! /(ft)!-2  ; they  depend  not  only  on  the  phase  shift  and  the  bound-state  energies,  but  also  on  die  bound- 
state  normalization  coefficients  CR>  defined  by  Eq.  (IS): 


n 

= (-)/“  2 »/  |y^Cn(£„)rl  J dkk*. 


d2/*1  ) 

ik^1  l*"7  W*]} 


(-)^‘  2*/<  > Cn(En)i-' I d**2>  1 - IA*)I“*  * > (-yfJR^"i>  (141) 


The  coefficients  - (-)"  £Jf,  entering  in  Eq.  (141)  are  those  characterizing  the  asymptotic  behavior  of  the  spectral 
function 


i - i/(fc)i~2  - - V (-)”£„  + 0(k^"^) 


The  spectral  function  is  expressed  in  terms  of  the  phase  shift  and  the  bound-etate  energies  through  the  formula 
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(143) 


jg*  V • 


where 


!/(*)!  - «p  [A(Ar>] 


Q 

(q*-k>) 


v(q) 


(144) 


Below  we  display  the  explicit  expressions  for  the  values  at  the  origin  of  the  first  10  derivatives  of  the  potential,  implied 
by  Eq.  ( 1 34)  and  the  values  of  the  coefficients  (p,  £ I s, , s3 , . . . , sg+  ,1  given  by  Calogero  and  Degasperis  [ 1 968] : 


*o=  -“/a  (MS) 

4 

V,  = ~J3  (146) 

k,=  -r(y’>a~"rJ«  047) 

8 4 

v3  = 7,73  - js  (148) 

v<=  -7(/3)3+5(y3)1+W^4-^-i6  (149) 

o 2 4 

v*  = - ~ (/»>*/,  + -^  73  74  + 27,  J;  -7,  (150) 

4 2 

Ks  = ~(7,)4  - 2872(73)2  + -f(74)a  _2(7j)V4  + 287375  --7  h (151) 

2 4 4 4 

K7  = -140(73 )J  + (73)37j  - 377,7374  + 17747s  -~-(7,)a7,  + 107374  + 37,7,  -7,  (152) 


- " — (^)S  + 258(7j)1(73)1  - 147j(74)1  + y(7,)3  + ' 246  J*  J*  J* 

+ 7747 4 - 5(7,)2  7,  + 547,7,  +7,7,  - — 7,0  (153) 

4 
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K,= 


*'io= 


6047  791  1667  523 

2310 72(7,)3  - — — - — /2(74)J  + — — (72)57,74  - 20872747,  - 23io(7J)J/,  + — 

48  4 4 0 

125  83  107  35 

+ — - /s76  - 1 157,73/4  + 7477  - (7j)  77 +'  /s/g  + 4/j/»  -J,, 

2 2 4 2 

59  * „ 9,96  , , 164  , . , 

— (A)6  + 11550(73)“  + — (7j)3(73)j ~ (/4)3  + + S852/2</3)J/4 

63 

-836/2(/,)J  -36527,747s  + 3344(7,  )j/j/5  + — (76)J  - 1087}7476  - ll00</3)276  + 46(/2)37t 

4 

, 5 1 

+ 4187,7,  -628/2/,/7  + 15747*  - 10(7,)J/»  + 88/,/,  + — 727l0 /I2 


(72 )'/, 
(1S4) 


(155) 


In  the  very  special  case  of  an  even  potential  - a potential  such  that  Vn  = t)  for  n odd  all  the  integrals  Jn 
with  n odd  must  vanish;  this  implies  an  (infinite)  number  of  constraints  that  must  be  satisfied  by  the  phase  shift,  the 
bound-state  energies,  and  the  normalization  coefficients  [Eqs.  (140)  and  (141)] . Thus  the  binding  energies  and 
normalization  coefficients  of  an  even  potential  are  determined  by  the  scattering  phase  shift,  which  also  must  satisfy 
an  infinite  number  of  integral  conditions.  For  instance,  if  only  one  bound  rtate  is  present,  and  the  potential  is  even, 
the  normalization  coefficient  C of  the  bound  state  is  given  by  the  explicit  equation 


k*(k*  -£)“*  exp  l-2A(*)]J  (1561 


while  the  energy  £ of  the  bound  state  can  be  obtained  from  the  equation 


£ 


ds 

d**s—  ks  (k1  - £)  exp[-2A(*)] 

d3  ) 

k6(k2  -E)-3  exp[-2A(*)]  j 


(157) 


Here  A(Ar)  is  given  in  terms  of  rj(Jk)  by  Eq.  (144).  Of  course,  for  an  even  potential  the  (even)  derivatives  of  the 
potential  at  the  origin  are  expressed  as  polynomials  of  the  integrals  Jn  with  n even  [Eqs.  ( 1 36)  and  (137)]  - that 
is,  only  in  terms  of  the  generalized  momentum  moments  of  the  piiasv  shift. 

The  possibility  of  expressing  all  the  derivatives  of  the  potential  at  the  origin  in  the  form  (134)  has  some  impli- 
cations for  the  so  called  “Jost  equivalent  potentials”  (which  have  the  same  phase  shift  and  bound-state  energies 
and  therefore  the  same  lost  function  f(k)  = l/(fr)l  exp  [/»?(*)]  [see  Eqs.  (143),  (144)] , but  possibly  yielding 
different  normalization  coefficients  Cn  ) and  for  the  “Bargmann  potentials"  (whose  Jost  function  is  a polynomial 
in  *r).  Fo'  these,  and  a brief  discussion  of  the  possibility  of  finding  other  explicit  relationships  between  the  scatter- 
ing data  d the  potential,  see  Catogero  and  Degasperis  [1968] ; see  also  the  last  section  of  this  paper. 
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RESULTS  FOR  HIGHER  PARTIAL  WAVES 


Some  of  the  approaches  described  in  the  previous  sections  extend  trivially  to  the  case  of  higher  partial  waves. 
Indeed  the  result  that  is  at  the  origin  of  this  line  of  research,  namely  the  explicit  expression  for  the  value  V0  of  the 
potential  at  the  origin  in  terms  of  the  scattering  phase  shift,  was  originally  given  by  Newton  [1956]  and,  less 
explicitly,  by  Faddeev  [1957]  for  an  arbitrary  partial  wave: 

4 V'  8 f°°  d 

V0  / Et„ I dkk [*t>o(*)1  (158) 

28+1  L-J  »«  *(28  + 1)  J0  d*  8 

where  T?g(lfc)  is  the  8-wave  scattering  phase  shift  and  are  the  energies  of  the  bound  states  (if  any)  with  angular 
momentum  8.  The  convergence  of  the  integral  in  the  right-hand  side  is  guaranteed  by  the  asymptotic  behavior  of 
the  phase  shift 


ng(*) a fok  1 +0(k  3 In *) 


(159) 


which  implies 


— [Ai7gC*)l  7— ' 0(k  In  k) 
dk  * k-K™ 


(160) 


As  we  discuss  below,  this  asymptotic  behavior  is  related  to  the  finiteness  of  the  value  of  the  potential  at  the  origin. 
The  trace  method  discussed  earlier  has  been  extended  to  higher  partial  waves  by  Percival  and  Roberts  [ 1 963] , follow- 
ing the  route  introduced  for  S- waves  by  Percival  [1962] , and  by  Buslaev  [1962, 1966] , who  based  his  approach  on 
the  method  introduced  for  S waves  by  Buslaev  and  Faddeev  [1960]  but  was  actually  able  to  treat  the  scattering 
problem  directly  in  three-dimensional  space,  without  performing  a partial  wave  expansion.  However,  as  noted,  ihe 
trace  method  is  unable  to  yield  the  values  of  all  the  derivatives  of  the  potential  at  the  origin  in  terms  of  the  scatter- 
ing and  bound-state  data,  except  in  the  very  special  case  of  an  even  potential  - that  is,  only  if  it  is  a priori  known 
that  all  the  derivatives  of  odd  order  of  the  potential  vanish  at  the  origin. 

The  main  difficulty  in  extending  the  method  of  asymptotic  expansions,  which  in  the  5-wave  case  provided  the 
most  complete  solution  of  our  problem,  to  the  case  of  higher  partial  waves  and  also  to  the  case  of  nonholomorphic 
potentials  is  due  to  the  fact  that  the  asymptotic  expansion  of  the  scattering  phase  shift  at  large  k contains  generally 
logarithmic  terms  in  addition  .o  powers  if  the  restriction  to  S waves  (and  to  holomorphic  potentials)  is  abandoned. 
There  is,  however,  a subclass  of  holomorphic  potentials  such  that  the  corresponding  scattering  phase  shifts  i\tfk),  for 
all  8 < L,  still  contain  only  (inverse)  powers  of  k in  their  asymptotic  expansions.  The  condition  characterizing  this 
class  of  potentials  is  the  requirement  that  the  values  at  the  origin  of  the  first  L derivatives  of  odd  order  of  the  poten- 
tial vanish; 


Kan+I  = 0 for  n = 0, 1 L-  1 (161) 

Note  that  there  is  clearly  no  restriction  for  1 = 0,  and  that  these  conditions  are  automatically  satisfied  for  all  values 
of  L if  the  potential  is  even. 
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The  discovery  of  this  oroperty  is  due  to  Corbella  [ 1971  ] , who  showed  that,  for  this  special  class  of  potentials, 
it  is  possible  to  reduce  the  fc-wave  problem,  with  8 < £,  to  an  5-wave  problem  with  a new  potential  that  is  still 
holomorphic  at  the  origin.  The  process  of  reduction  implies  that  the  5-wave  phase  shift  and  bound-state  parameters 
of  the  new  problem  essentially  coincide  with  the  B-wave  phase  shift  and  bound-state  parameters  of  the  original 
problem  (see  below).  Then,  of  course,  the  results  of  Calogero  and  Degasperis  [ 1 968]  for  the  5-wave  case  can  be  taken 
over,  and,  in  principle,  the  complete  solution  of  our  problem  is  achieved  for  this  special  class  of  potentials  We  refer 
for  a detailed  ticatment  to  the  paper  by  Corbella  [1971] , limiting  discussion  here  to  a simple  outline  of  the  method 
and  results.  The  original  mathematical  trick  for  the  transition  from  a P- wave  problem  (8=1)  to  an  5-wave  one 
(8  = 0)  is  due  to  Calogero  [unpublished,  1970] ; its  development  and  use  is  due  to  Corbella  [1971  ] . 

The  basis  of  the  method  is  as  follows.  Let  y(r)  be  the  solution  of  the  differential  equation 


y"(r)+  [k2  - IVWjyfr)  = 0 


(162) 


and  introduce  a function  f(r)  through  the  Riccati  equation 


fir)  = f2(r)-W(r) 


Define  the  function  y (r)  through 


y(r)  = lXr)y(r)  + y'(r) 


(163) 


(164) 


Then  this  function  satisfies  the  new  differential  equation 


y >)+[*’ -*(')]}  to  = 0 


(165) 


with 

Hr)  = W(r)  + 2/'(r) 

as  can  be  verified  by  direct  substitution.  Assume  now  that 

W(r)  = 8(8  + l)r-3  + V(r) 

so  that  Eq.  (162)  becomes  the  8-wave  radial  Schrodinger  equation 


0"eto+  [k2  -8(8+  Dr"1  - Kto]0s(r)  = 0 


Then  set 


/eto 


r 


f8to 


(166) 


(167) 


(168) 


(169) 
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so  that  Eq.  (163)  becomes 


/gW  = £g(r)  + 2~~  g^(r)  - F(r) 


(170) 


while  Eq.  (164)  defines  a new  function 


«£WUe(r)  + <ig(r) 


(171) 


The  index  6-1  has  been  introduced  because  Eq.  (165)  now  reads 

0 g-iW+  -(6  - l)6r_J  - K(r)j  0e_l(r)  = 0 


(172) 


with 


Hr)  = V(r)-2g\(r) 


(173) 


Thus  the  function  <t>  g_(  (r)  satisfies  now  the  radial  Schrodinger  equation  appropriate  to  the  angular  momentum 
quantum  number  6-1,  with  the  new  potential  V {»-)  of  Eq.  (1 73),  the  function  gp(r)  being  defined  by  the  Riccati 
equation  (170). 

Repeating  this  procedure  6 times,  one  arrives  at  the  equation 

0(r)+[fc2-K(r)]«(r)  = 0 (174) 

with 


6 

V(r)  = V(r)-2Y  M (175) 

n=l 


The  functions  gn(r)  are  defined  by  the  recursive  Riccati  equations 


SgW  = $r)  + 2 y gt(r)  - V(r)  ( 1 76) 


g'n(r)=  S>)  + 2 — gn(r)-V(r)-2  V g'm(r)  n = 6-1, 6-2, . . . , 1 (177) 

r m=n+l 


Equation  ( 1 74)  is  the  5- wave  radial  Schrodinger  equation  with  the  potential  P ( r ),  and  it  is  easy  to  see  that  the 
5-wave  phase  shift  and  bound-state  parameters  corresponding  to  this  potential  coincide,  or  are  very  simply  related. 
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to  the  8-w?ve  phase  shifts  and  bound-state  parameters  produced  by  the  original  potential  V(r) . Indeed,  the 
explicit  relations  given  by  Corbella  [1971]  are 


= t|(*J 

(178) 

*8 

Ekn 

= £■« 

(179) 

II 

3o» 

(180) 

where  ih;  quantities  rjg(fc),  and  C^,  are  the  8-wave  phase  shifts,  bound-state  energies,  and_bound-state 
normalization  coefficients  for  the  original  problem  with  potential  V(r).  while  v(k_).  En  and  are  the  S-wave 
phase  shift,  bound-state  energies  and  normalization  coefficients  for  the  potential  Vlr). 


Itiseasy  to  prove  [ Corbella , 1 971  ] |hat  if  the  condition  (161  ) holds  and  if  the  original  potential  V(r)  is 
holomorphic  at  r = 0,  the  final  potential  V(r)  of  Eq.  ( 1 75)  also  is  holomorphic  at  r = 0 ( for  8 < 4 J-  Then  the 
results  of  Calogero  and  Degasperis  [1968]  are  applicable  to  the  S-wave  problem  with  the  potential  V (r),  so  that 
the  values  at  the  origin  of  this  potential  and  its  derivatives  ar^expressible  by  the  explicit  formulas  ( 145 j - ( 1 55) 
in  terms  of  the  phase  shift  i?(fc)  and  the  bound-state  data  En  and  Cn , which  are  very  simply  related  to  the 
corresponding  quantities  for  the  original  £-wave  problem  with  potential  V(r)  [see  Eqs.  (1  78)  through  (1 80)] . But 
the  definition  (175),  together  with  tlje  recursive  equations  ( 176),  (177),  imply  definite  relationships  between  the 
values  at  the  origin  of  the  potential  V(r)  and  its  derivatives,  and  the  values  at  the  origin  of  the  potential  V(r)  and 
its  derivatives.  This  is  the  case  only  for  values  of  8 such  that  8<4,  with  L defined  by  the  condition  (161).  In 
particular  Corbella  [1971]  gives  the  explicit  formulas 


V0 


% 

28+  1 


. = (%-  [28/(28+  1)](?>)2| 
2 ' (28-  1)(2B+  1)(28+  3)  | 


(181) 


(182) 


Vi  = 8=1 


V4  = 45 


| P4  - [48/(68  + 3)]  P0 K3  + [48(28  - l)/(68  + 3)2 ] ( K0)3  - (5/24)(K,  )3 

I 


(28  - 3)(2fi  - 1 )(2B  + 1)(28  + 3)(28+  5) 


V,  = - — 


Vt  = 


-1k--LPoP,~±fiP1+ 

2 ( 3 10  45  f 
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8=1 


8=2 


(183) 


(184) 


(185) 


(186) 
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where,  of  course,  Uw,isthe  nth  derivative  of  the  potential  K(r)  evaluated  at  the  origin,  and  Vn  is  the  nth  deriva- 
tive of  the  potential  V(r),  also  evaluated  at  the  origin.  The  validity  of  the  restriction  (161)  i«  always  assumed. 

These  equations,  together  with  the  correspondence  of  Eqs.  (178-  180)  and  the  results  of  Calogero  and  Degasp. ris 
[1968]  [see  Eqs.  (145  - 155)] , yield  explicit  expressions  of  Vn,  lor  n up  to  5,  in  terms  of  the  V-vave  scattering 
and  bound-state  data.  Note  that  tits  first  of  these  equations  reproduces  the  original  result  of  Faddcv  [1957]  and 
Newton  [ 1 956] , Eq.  ( 1 58). 

It  must  be  emphasized  that  the  explicit  expressions  of  the  potential  and  its  derivatives  at  the  origin  in  terms  of 
phase  shifts  and  bound-state  data  for  different  angular  momenta  imply  definite  integral  relationships  between  scatter- 
ing phase  shifts  and  bound-state  data  corresponding  to  different  angular  momenta.  One  consequence  of  these  relation- 
ships is  the  possibility  of  expressing  the  bound-state  parameters  in  terms  of  the  scattering  phase  shifts.  For  instance 
for  a potential  that  possesses  only  one  S-wave  bound  state  and  no  P- wave  bound  state  the  Newton-Faddeev  Fq.  (158) 
immediately  implies 


£ = 


(187) 


where  £ is  the  energy  of  the  S- wave  bound  state,  and  rj0(k)  resp.  77,  (&)  are  the  S-wave  resp.  £-wave  scattering 
phase  shifts. 


RESULTS  FOR  THE  KLEIN-GORDON  AND  DIRAC  EQUATIONS 


The  results  of  Calogero  and  Degasperis  [1968]  have  been  extended  to  the  case  of  the  Klein-Gordon  equation 
by  Degasperis  [1970] , and  to  the  Dirac  equation  by  Corbella  [1970] . The  interested  reader  is  referred  to  these  two 
papers  for  a clear  and  detailed  presentation,  which  cannot  be  given  here  as  it  requires  a previous  treatment  of  the 
Klein-Gordon  and  Dirac  inverse  scattering  problems.  The  results  obtained  are  interesting  per  se  and  constitute  prog- 
ress toward  a more  complete  solution  of  the  inverse  scattering  problem  for  Klein-Gordon  and  Dirac  equations  than 
had  been  previously  achieved. 

The  scattering  and  bound-state  problems  for  particles  described  by  the  Klein-Gordon  or  the  Dirac  equations 
present  some  new  features,  connected  with  the  relativistic  nature  of  these  equations.  The  most  important  is  the 
appearance  of  negative  energy  solutions,  which  are  interpreted  as  corresponding  to  antiparticles.  Another  charac- 
teristic feature  is  the  possibility  of  generalizing  the  basic  equations  introducing  two  potentials,  characterized  by 
different  transformation  properties  - one  behaving  as  the  fourth  component  of  a relativistic  fourvector  and  the  other 
as  a relativistic  scalar.  The  canonical  forms  of  the  Klein-Gordon  and  Dirac  equations,  however,  contain  only  the 
former. 

The  inverse  scattering  pioblem  that  has  been  solved  requires  as  input  data  the  scattering  phase  shift,  bound- 
state  energies,  and  normalization  parameters  for  one  value  of  the  angular  momentum,  both  for  particles  and 
antiparticles , it  yields  as  output  (through  the  solution  of  integral  equations)  two  potential  functions.  Th . require- 
ment thet  one  of  these  vanish  identically,  which  must  be  imposed  if  the  usual  form  of  the  Klein-Gordon  or  Dirac 
equat’on  is  being  considered,  clearly  implies  restrictions  on  the  input;  the  particle  and  antiparticle  data  are  then  not 
independent,  and  indeed  it  is  plausible  to  conjecture  that  the  data  for  the  particle  (or  the  antiparticle)  are  sufficient 
to  determine  the  potential,  and  therefore  also  to  determine  the  data  for  the  antiparticle  (or  for  the  particle).  However, 
no  definite  results  have  been  proved  so  far. 


5-63 


•M. 


The  results  obtained  by  Degasperis  [1970]  and  Corbella  [ 1970]  are  explicit  expressions  for  the  values  at  the 
origin  of  the  two  potentials,  and  their  first  few  derivatives,  in  terms  of  the  phase  shifts  and  bound-state  data  for  the 
particle  and  the  antiparticle  (for  S waves  in  the  Klein-Gordon  case,  for  S and  P waves  in  the  Dirac  case).  Then 
the  requirement  that  only  one  potential  be  present  - that  the  second  one  vanish  identically  •-  provides  explicit 
integral  relationships  between  the  scattering  and  bound-state  data  for  the  particle  and  for  the  antiparticle.  No  such 
relationships  had  been  previously  obtained,  and  they  constitute  a first  step  towards  a more  complete  understanding 
of  the  inverse  scattering  problem  for  the  usual  Klein-Gordon  and  Dirac  equations.  Indeed,  if  all  the  derivatives  of 
the  potentials  at  the  origin  are  considered,  the  requirement  that  all  vanish  for  the  second  potential  yields  an  infinite 
set  oi  integral  relations  between  the  scattering  and  bound-state  data  for  the  particle  and  those  foi  the  antipaiticle, 
which  might  be  sufficient  to  determine  one  set  from  the  other.  This  suggests  a procedure  to  prove  the  conjecture 
mentioned  above. 


FINAL  REMARKS 


We  have  reviewed  the  results  obtained  for  a rather  specific  problem  - that  of  obtaining,  in  the  framework  of 
quantal  potential  scattering,  explicit  expressions  for  the  values  at  the  origin  of  the  potentid  and  its  derivatives,  from 
a knowledge  of  the  scattering  and  bound-state  oata  (for  one  value  of  the  angular  momentum).  In  this  final  section 
we  briefly  discuss  the  phenomenological  applicability  of  these  results.  We  indicate  the  extensions  that  are  naturally 
suggested,  but  have  not  yet  been  accomplished,  and  we  mention  some  other  developments  that  fit  into  the  same 
philosophy  of  obtaining  more  explicit  results  than  those  yielded  by  the  standard  solution  of  the  inverse  scattering 
problem  (that  requires  the  solution  of  an  integral  equation). 

The  main  obstacle  to  the  phenomenological  applicability  of  the  results  reported  in  this  paper  is  the  need  to 
know  the  values  of  one  scattering  phase  shift  for  all  values  of  the  energy.  This  difficulty  is  common,  of  course,  to 
all  approaches  to  the  inverse  scattering  problem  that  take  as  input  the  data  for  a given  angular  momentum.  It  is, 
however,  particularly  relevant  to  the  approach  discussed  in  this  paper,  which  focusses  on  the  values  at  the  origin  of 
the  potential  and  its  derivatives,  because  these  features  of  the  potential  are  more  sensitive  to  the  information  con- 
tained in  the  behavior  of  the  potential  at  high  energy.  Indeed,  in  many  phenomenological  contexts,  the  high-energy 
behavior  of  the  phase  shift  and  the  short  range  behavior  of  the  potential  are  just  the  regimes  where  the  representation 
of  the  physical  problem  in  terms  of  a potential  is  less  reliable.  This  same  argument  can  also  be  used  to  justify  the 
relevance  of  this  approach;  for  the  explicit  relationships  that  we  have  discussed  constitute  the  more  appropriate  way 
to  evaluate  to  what  extent  the  phenomenological  data  determine  the  potential,  and  its  derivatives,  at  the  origin,  and 
thus  they  provide  a quantitative  basis  to  assess  the  consistency  with  phenomenology,  and  the  definiteness,  of  the 
description  of  the  interaction  in  terms  of  a potential,  in  the  neighborhood  of  the  origin.  A useful  tool  in  this  connec- 
tion is  the  possibility  of  obtaining  the  parameters  characterizing  the  asymptotic  behavior  of  the  phase  shifts  from 
their  overall  dependence  on  the  linear  momentum  k [see  Eqs.  (138  - 139)] . 

The  more  complete  results  have  been  obtained  so  far  for  the  case  of  nonrelativistic  S- wave  scattering  on  a 
holomorphic  potential,  as  discussed  earlier.  The  extension  of  these  results  to  potentials  that  have  an  r~'  singularity 
at  the  origin  (more  specifically,  potentials  such  that  rV(r)  is  holomorphic  at  the  origin)  is  closely  connected  with 
the  generalization  to  higher  partial  waves  [for  potentials  that  do  not  obey  the  restriction  of  Eq.  (161)] . A first  result 
in  this  direction  [Degasperis.  private  communication,  1967]  notes  that  the  F&ddeev-Newton  relation  ( 1 58)  remains 
essentially  valid  for  a potential  whose  expansion  at  the  origin  has  the  form 
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for  it  can  then  be  written  in  the  form 


V0  - V,  : 


r°° 

7o+—\  d*  k fo(*)-ij(*)] 
* Jo 


(189) 


provided  Tj(k)-r?(fc)  vanishes  at  infinity  sufficiently  fast  for  the  integral  to  converge.  Since  for  potentials  of  type 
( 1 88)  the  two  leading  terms  in  the  asymptotic  behavior  of  the  phase  shift  are  given  by  the  asymptotic  formula 


b0  In  k aQ 

m r~  * ~r  * 


/in  k\ 


(190) 


we  can  rewrite  Eq.  (189)  in  the  form 


f"° 

V0  l d*  [ki}(k)-b0\nk-a0\  =B0(a0,b0) 
o 


091) 


where  B0  is  now  an  universal  function  of  the  arguments  shown.  An  equivalent  way  to  write  this  formula  is 


vo~j  ["*  -^~]P  [*n(*)l  ■ B0  («o.  b0)  p = 2, 3, 4,  ■ 


(192) 


I ' 


The  choice  of  an  arbitrary  integral  value  of  p in  Eq.  (192)  is  justified  by  the  remark  that,  if  a function  vanishes 
sufficiently  fast  at  *= 0 and  as  k-**°,  then  by  partial  integration 


J*  *[-*  d*]/(*)=  J ikm 


On  the  other  hand,  the  need  for  the  integer  p in  Eq.  (192)  to  be  larger  than  unity  is  connected  with  the  asymptotic 
behavior  (190)  of  the  phase  shift. 

The  universal  function  B0{a0,  b0)  has  been  recently  determined  by  A.  Degasperis  [private  communication, 
1971].  Thus,  the  final  formula  for  V0  is 


V°  “"7  J d*]  ”*3* P * 2>3’4’ 


(193) 
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where  bQ  is  defined  by  Eq.  (192)  and  is  related  to  by  the  simple  equation 


V-,  - -2  b0 


0*4) 


These  equations  have  been  written  for  the  S- wave  case  without  bound  states.  The  inclusion  of  bound  states  presents 
no  problem;  nor  does  the  extension  to  higher  partial  waves.  Indeed,  these  recent  results  of  Degasperis,  together  with 
the  reduction  technique  discussed  earlier,  open  up  the  possibility  to  obtain  explicit  expressions  for  all  the  derivatives 
of  the  potential,  evaluated  at  the  origin,  in  terms  of  the  scattering  phase  shifts  and  bound-state  data  for  higher  partial 
waves,  even  if  the  potential  does  not  satisfy  the  condition  (161). 

The  possibility  of  treating  potentials  of  type  (188)  and  higher  partial  waves,  besides  being  interesting  per  se,  has 
a large  potential  for  application;  for  instance,  it  opens  the  possibility  of  dealing  with  the  nucleon-nucleon  scattering 
problem,  especially  in  the  context  of  the  so-called  one-boson-exchange  potentials  [Calogero,  1969] . (Note  that  the 
formula  for  V0  reported  in  this  reference  is  incorrect,  as  it  was  obtained  using  the  conjecture  that  the  universal 
function  B0  vanishes  when  no  bound  states  ate  present.) 

Another  generalization  that  would  considerably  widen  the  phenomenological  applicability  of  the  approach  is 
to  multichannel  potential  scattering  problems.  The  only  result  for  this  case  is  contained  in  the  original  paper  by 
Faddeev  [1957] ; it  relates  the  value  at  the  origin  of  the  trace  of  the  potential  matrix  to  the  generalized  first  momen- 
tum moment  of  the  trace  of  the  logarithm  of  the  scattering  matrix  (or  equivalently,  of  the  sum  of  all  the  scattering 
eigen  phase  shifts). 

As  mentioned  above,  the  explicit  expressions  for  the  values  at  the  origin  of  the  potential  and  its  derivatives 
discussed  in  this  paper  are  not  the  only  explicit  relations  between  the  potential  and  the  scattering  and  bound-state 
data  that  are  known.  Other  explicit  expressions  have  been  obtained  from  a study  of  the  asymptotic  behavior  of  the 
scattering  phase  shift,  which  yields,  for  example,  the  formulas  [Calogero  and  Degasperis,  1968] 

J dr  K(r)  = -2a0  (195) 

o 
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I drKJ(r)a-8fl>+4^c"+[^']  x J (196) 


where  the  coefficients  an  arc  v.cs*e  characterizing  the  asymptotic  behavior  of  the  S- wave  scattering  phase  shift  17(A) 
through  Eq.  (138)  (and  are  there! 01  c expressed  explicitly  in  terms  of  rj(it)  by  Eq.  (139)  ),  the  modulus  of  the  Jost 
function  I f(k)  I is  explicitly  expressed  in  terms  of  the  phase  shift  and  the  bound-state  data  by  Eqs.  (143-144),  and 
Cn  arc  the  normalization  constants  of  the  S- wave  bound  states  (defined  by  Eq.  (15)).  However,  as  exf ' ’ined  by 
Calogero  and  Degasperis  [1968] , it  is  not  possible  to  obtain  other  simple  expressions  of  this  kind,  in  fact,  the  next 
expression  of  this  type  that  could  be  explicitly  related  to  the  scattering  and  bound-state  data  has  the  form 


r r 

j drK»+yl  drlK'OOl*  (197) 

o o 

namely,  it  contai  » the  potential  in  a rather  complicated  way. 
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Other  relations  between  the  potential  and  the  scattering  and  bound-state  data,  in  which  energy  moments  of  the 
phase  shifts  of  negative  order  enter,  have  been  obtained  by  Roberts  [1964a] . These  relationships,  however,  do  not 
involve  the  potential  in  an  explicit  way. 

Another  relevant  result  given  by  Calogero  et  al  t i 968]  equates  a simple  functional  of  the  potential  to  an  explic- 
it variational  expression  in  ter. ns  of  the  scattering  and  bound  state  data.  A simple  conseri..t-',ce  of  this  result  is  the 
following  variational  bound  for  the  Laplace  transform  of  lb*  potential 


J 

* n 


dr  K(r)exp(-2pr)>  -J  Cnl(p2+En) 


+ 


rr 


dq[qil(q2+pi)][\-  \j{q)\  ’] 


(198) 


where  the  only  restriction  on  the  positive  constant  p is 


P 


>(-En) 


(199) 


and  the  other  symbols  have  the  usual  meaning  (see  above). 

An  appealing  direction  for  research,  consistent  with  the  philosophy  of  searching  for  explicit  relationships 
between  the  potential  and" the  scattering  and  bound  state  data,  but  implying  more  radical  modifications  in  the 
approach,  takes  as  point  of  departure  a kind  of  information  different  from  that  contained  within  a single  scattering 
phase  shift  (assumed  known  for  all  energies)  and  the  corresponding  bound-state  parameters.  An  instance  of  this  kind 
is  provided  by  the  results  of  Buslrev  [1962, 1966] , who  worked  in  terms  of  the  full  scattering  amplitude  (assumed 
known  for  all  eneigies),  alth  of  course,  as  mentioned  in  the  first  section,  for  a spherically  symmetrical  potential 
once  the  full  scattering  amplitude  is  known,  the  partial  wave  scattering  amplitudes  e\p[h/g(fc))  sin  [r/g(<c)]  and 
therefore  the  scattering  phase  shifts  rjg(A)  also  are  known. 

Finally,  a very  interesting  direction  of  research  is  the  attempt  to  apply  the  techniques  described  here  to  the 
quantal  three-body  and  many-body  problems.  Practically  no  results  for  the  many-body  problem  have  been  obtained 
so  far,  besides  those  of  Uhlenbeck  and  Beth  [1936] , consisting  of  an  exploitation  of  the  trace  approach  in  the  con- 
text of  statistical  mechanics  (see  also  the  more  rec  it  and  complete  contribution  by  Berezin  [1964] ) Results  for 
the  three-body  problem  that  appear  very  interesting  have  been  recently  announced  by  Buslaev  ard  Merkurev  [1970] . 


4 
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APPENDIX  A 


In  this  appendix  (that  coincides  with  Appendix  D of  the  paper  by  Ctdogero  and  Degasperis  [1968) ),  we  collect 
some  useful  operator  and  integral  formulas. 

First  of  all  we  introduce  the  operator 


D = k-2-k 
d * * 


(M) 


and  we  note  the  identity 


' (‘5  *)'  ‘ *■©** 


(A2) 


which  is  easily  proved  by  induction. 

A remarkable  property  of  the  operator  D is  expressed  by  the  following  formula: 


D" 


fm* ~m  = 0,  if  n>N, 


= (-)" 


(m-1)! 


if  n <N. 


(A3a) 

(A1’ 


We  then  consider  a function  f(k)  whose  asymptotic  expansion  (k  -*■  ±«)  is  of  the  following  type 


N 

/(*)  = k~P  ^ fnk~2n  + 0(k~P~2 Ar-2)>  p s o or  1.  (A4) 

n=l-p 


We  also  assume  that  f(k)  and  all  its  derivatives  are  finite  on  the  real  axis,  jid  that  the  asymptotic  behaviors  of  the 
derivatives  of  f{k)  may  be  obtained  differentiating  Eq  (A4). 

For  functions  of  this  type  we  write  the  integral  identity 


J 


+oo 

d*  *2n+P  f(k)-k~P 


m=l-p  00 


(A5) 


l'he  validity  of  this  rel*  ■ is  easily  proved  by  partial  integration  and  using  Eqs.  ( * 3a)  and  (A4).  Of  course  the  value 
of  p in  Eqs.  (A4)  and  (noj  must  be  tne  same. 
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Another  useful  formula,  whose  proof  requires  only  one  extra  partial  integration,  is  the  following: 


>.2n+p+\  j 
d k 


\m-k-p 


= (_)P+)(2n+PlU 

(2n+p)!  J ^ 


(A6) 


Again  the  value  of  p must  be  the  same  as  in  Eq.  (A4). 

It  is  easily  seen  that  Eqs.  (AS)  and  (A6)  remain  valid  if  the  integral  runs  only  over  one  semiaxis,  if  the  function 
f(k)  has  2 definite  parity,  so  tha  the  integrands  in  Eqs.  (AS)  and  (A6)  are  even. 

Finally  we  write  integral  expressions  for  the  asymptotic  coefficients  fn  themselves: 


fn  = H**1-:  -1—  f dl t-U2"+P/(*) 

(2/i+p-l)!  J k 


(A7) 


These  are  easily  proved  by  partial  integration  and  using  Eq.  (A3a).  Again  the  value  of  p must  be  the  same  as  in 
Eq.  (A4).  Note  that  the  integrand  in  the  right-hand  side  of  this  equation  is  an  odd  function  of  k,  if  f(k)  has  the 
same  parity  of  its  arymp*  , ' ' expansion. 
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APPENDIX  B 


In  this  appendix  (that  coincides  with  Appendix  C of  the  paper  by  Calogero  and  Degaspem  [ 1 968] ),  we  report 
for  completeness  an  explicit  elementary  derivation  of  the  well  known  formula,  Eq.  (72).  that  relates  the  derivative  of 
the  ( 5-wave)  scattering  phase  drift  r?(A)  produced  by  the  potential  K(r)  to  die  increment  nj(E)  in  the  density  of 
states  due  to  the  same  potential. 

We  enclose  the  system  in  a sp.  erica!  box  of  radius  R.  In  the  absence  of  potential  the  allowed  states  are  charac- 
terized Sy  the  wave  functions  sin  (krr),  with 


knR  = nir 


(B!) 


Thus  the  density  of  states  in  this  case  is 


n0(£) 


An 
A E 


A n 
2 k Ak 


R 

Ink 


(B2) 


On  the  other  hand,  if  the  potential  is  present  and  if  R is  large  enough  so  that  the  asymptotic  expression 
sin  [kR  + rj(Jt)]  may  be  used  in  place  of  the  radial  wave  function,  then  in  place  of  Eq.  (Bl)  we  have 


knR  + ij  (kn)  = nt 


(B3) 


Here  of  course  t)(*)  is  the  phase  shift  due  to  the  potential.  Thus  the  density  of  states  in  this  case  is 


ME) 


An  _ 1 ^ +_Aq 

A E Ink  A * 


(B4) 


But  by  definition 


«/(£)  = «(£)-«o(£) 


(B5) 


and  inserting  in  this  equation  the  expressions  (B2)  and  (B4)  we  get,  in  the  limit  of  the  continuum  (R  -*•  °°,  A i:  •*  dk), 
Eq.(72),q.e.d. 


APPENDiX  C 


This  appendix  coincides  with  appendix  E of  the  paper  by  Calogero  unj  Degaspens  [1968] . In  it  we  investigate 
the  expression 


Si 


/*+0°  f*00 

— I dr  I dk  txp(ikr)[(T+  Vf  - 7P]exp(iftr) 

«/—oo  »-oo  % 


(Cl) 


where 


T = - 


_d^_ 

dr2 


(CT> 


and  K is  a holomorphic  function  of  r which  vanishes  sufficiently  fast  at  infinity  to  allow  any  number  of  integrations 
by  parts  with  neglect  of  the  contributions  at  both  extremes. 


We  begin  evaluating  tae  integral 


(C3) 


>•  +OC  pr  * +oo  ^+oo 

— I dr  I dk  exp(ikr)f(r)  Tg(r)  cxp(iKr)  I dr  I dk  exp(2ikr)f(r)^-g"(r)-2ikg'(r)+k3g(r)J 

^ OO  J—ao  J—  oo  ^ — oo 


where  /(r'  and  g(r)  are  arbitrary  functions  which  vanish  asymptotically  together  with  their  derivatives.  We  use  the 
identities 


and 


2ik  exp(2ikr)  -”p(2iikr) 

dr 


, 1 d 

* exp(2 i*r)  = - — — exp(2i*r) 


(C4r.) 


(C4b) 


to  eliminate  the  terms  linear  and  quadratic  in  k in  the  right-hand  side.  We  then  integrate  by  parts  to  eliminate  the 
derivatives  acting  on  the  exponential  and  we  thus  secure 


» + 05  ~ +00 

H.  ■[ 


dk  exp (ikr)f(r)Tg(r)  exy(ikr) 


(C5) 


y*+oo  y-  -f  pc 

= ~~  l ^ I d*exp(2i*r)[2/(r)gV)-/,'(»’)g(r)-/(r)g'’(r)] 

»-«  OO 


5-71 


Finally  using  the  expression 


Hr) 


■if  * 


exp(2»fcr) 


where  5 is  the  Dirac  distribution,  we  obtain 


/*+oo  />+» 

yl  *■  J d*  exp (ikr)f(r)  Tg(r)  exp(ifa-)  = ~ [2 f(r)g\r)  ~f"(r)g(r)  ~/(r)g”(r)] J 

•T— OO  OO 


It  is  convenient  at  this  point  to  introduce  the  two  differential  operators  d and  d , defined  as  follows: 


/(r)d  gir ) » f(r)g‘(r) 


f(r ) d g(r ) = f'(r)g(r) 


We  nay  then  rewrite  Eq.  (C7)  in  the  form 


J*  TOO 

~ J d*  exp(i*r)/(r) Tg(r)exp(ikr)  = ~/(r)  (?-d)*  *(r)|  q 

«^-00  oo  ' 


On  the  other  hand  we  also  have,  for  an  arbitrary  function  /(r) 


/i+oo  yw-t-oo 

tf  */  • 

W-OO  oo 


exp(i*r)/(r)exp(»*r)  - /( 0) 


From  this  equation  a.id  Eq.  (C9)  we  infer  that 


« -foo  f*  +00 

jf  dr  j to  txp(ikr)  j(  7>  V)p  - 7*]  exp(i*r)  = |l  • [K(r)- ^-(?-d)3]P  • 1^_q 

• ■"  OO  op 


It  is  interesting  to  note  a remarkable  property  of  the  operator  J + <f,  namely 

fir)  (?+d)g(r)  * 1 f(r)g(r)] 

dr 
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(C6) 

(C7) 

■ 

(C8a) 

(C8b) 

(C9) 

(CIO) 

(Cll) 

(C12) 


Thus  the  operator  d + d commutes  with  a function  of  r,  namely  its  position  within  a product  is  irrelevant: 


(d  +hf(r)g(r)  = f(r)  (d  + d)*(r)  = f(r)g(r)  (d  + d ) (C13) 

4— 

This  is,  unfortunately,  not  the  case  for  the  operator  d -d . 

We  may  use  this  property  of  the  operator  A + <T  to  express  the  right-hand  side  of  Eq.  (Cl  1)  in  the  form 

|l-|V(r)-i(d-d)2lP-l[=^  (f)(-)82“2«  ^ |l-(V(r)  + d-d]p_C-l}  (C14) 
( ‘ 8=o 

which  is  obtained  using  the  identity 

(d-d)  = (d  + d)  - 4d  • d (CIS) 


and  the  binomial  expansion,  ^'hich  is  applicable  tha.-.ks  to  the  commutativity  of  d+d.  This  expansion  may  be 
useful  because  the  operator  d • d is  such  that 


1*  [K(r)+d -d]P-l  = K(r)[K(r)  + ? • jjP  ^ K(r),  p = 2.  3.  • • • ; (C16) 


moreover  this  term  contains  derivatives  only  up  to  the  order  p-2.  On  the  other  hand  the  quantity  in  the  left-hand 
side  cf  Eq.  (C14)  contains  derivatives  up  to  the  order  2p-2.  Note  that  from  Eqs.  (C14)  and  (C16)  one  easily  gets 


i.[k(,)-|(J-S),]',-i  - (-f“P2'-r  I'M 

+ terms  containing  derivatives  of  order  2p-4  or  less 


217) 
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APPENDIX  D 


In  this  appendix  we  derive  explicit  expressions  for  the  coefficients  gn  of  the  asymptotic  expansion  of  g(k), 
in  terms  of  the  (5-wave)  scattering  phase  shift  r<(k).  The  treatment  given  here  is  a simplified  version  of  appendix  G 
of  the  paper  by  Calogero  and  Degasperis  [1968] , the  simplification  resulting  mainly  from  the  assumption  that  no 
bound  states  are  present.  We  recall  that  this  assumption  implies  that  the  Jost  function  /(£),  and  the  function  g(k) 
of  Eq.  (24),  are  both  holomorphic  in  the  lower  half  of  the  complex  /c-plane. 

We  consider  separately  the  expansions  of  the  real  and  imaginary  parts  of  g(k),  since  Eq.  (D17)  obviously 
implies,  for  real  k , 


Reg(k) 


(-)"+l  (2 kT**-1  + Oik-*"-*) , 


(Dl) 


lmg(k) 


gin  (-)”  (2 + 0(k~2"-3) 


(D?) 


We  treat  first  the  imaginary  part.  Let  us  recall  that  the  explicit  expression  of  this  quantity  for  real  k is 


Im  g(k)  = k { 1 - exp  [-2A(*)]f 


(D3) 


where 


Aik) 


ir 


f-k7 


via) 


(D4) 


It  is  convenient  to  discuss  first  the  asymptotic  expansion  of  Aik),  that  was  introduced  earlier: 


A(*)  = 2^  d „ k~*n  + Oik-*"-*) 


(D  5) 


We  now  prove  that 


dk&P-1  nik) 


(D6) 
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This  formula  has  been  quoted  from  the  paper  of  Buslaev  and  Faddeev  [I960] . The  operator  D is  that  defined  in 
Appendix  A. 

Beiore  providing  a proof  of  this  formula  we  show  how  it  may  be  obtained  by  a less  rigorous  but  more  straight- 
forward approach.  In  fact  if  we  substitute  in  Eq.  (D4)  the  asymptotic  expansion 


tf-k2)-' 


N 

q2P~2  k~2P  + 0(k~2^~2) 

p=l 


(D7) 


we  get  formally 


A (k)  = 


J d qq2P  1 q(q)  + 0(k  2N  2) 


(D8) 


which  implies  Eq.  (D6)  because,  by  partial  integration  (see  Appendix  A) 


-f—1  f 


dk  D2P~l  n(k) 


■ f 


dk  k2 


m 


(D9) 


Of  course,  the  last  two  equations  need  not  make  sense,  because  the  integrals  in  the  right-hand  side  generally  do  not 
converge.  Both  equations,  howevei,  become  valid,  provided  an  appropriate  part  of  its  asymptotic  expansion  is 
subtracted  from  the  function  v(k),  so  as  to  make  the  integrals  convergent.  Specifically  in  place  of  77(A)  we  must 
substitute,  in  both  equations. 


P~  1 

T?(*)  - £ ank~2n~'  (DIO) 

n=  0 


where  the  coefficients  an  are  those  of  Eq.  (33).  It  has  been  proved  in  Appendix  A that  with  this  substitution, 
Eq.  (E>9)  holds  true.  There  remains  now  to  show  that  with  this  substitution  also  Eq.  (D8)  becomes  correct. 

To  prove  this  result  we  introduce,  after  Roberts  [19646] , the  function 


h(k) 


d 

d* 


In  f(k)  = 


df(k)/dk 

m 


dA  (k)  drj(A) 

+ i 

dk  L dk  J 


(Dlla) 


(Dllb) 
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and  its  asymptotic  expansion 


'..-  ' l 

:.-....  ■ 1 
. - » 


*(*)  = » 2 (-2i*)-^Aw  + 0(fc-^-3) 
n=0 


Equation  (D1  lb)  follows  from  Eqs.  (21  - 22).  A comparison  of  this  expansion  with  those  for  A(k)  and  17(A), 
Eqs.  (DS)  and  (33),  implies  theough  Eq.  (D1  lb)  the  relations 


dp  « (-)P— 2~JP-5  hJp_, 


On  the  other  hand,  integrating  the  function 


F(k)  = k*P  h (k)  - i £ ^ 1 (-2/A)-”-* 
n= 0 


along  a contour  composed  of  a large  semicircle  in  the  lower  half  fc-plane  and  of  the  real  axis  idented  at  k-  0,  and 
using  the  fact  that  the  function  h(k)  is  holomorphic  in  the  lower  half  plane  (because  f(k)  is  holomorphic  and  it  has 
no  zeros),  one  finds 


if*0  d " P-1 

-)P hip.x  = I d**Jp—  v(k)~  £ an i 

) -00  n- 0 


From  this  equation  and  Eq.  (D13)  we  get 


p- 1 

2"”  * U, 


This  equation,  by  Eq.  (A6),  becomes  Eq.  (D6). 

There  remains  now  to  express  the  quantity  gip  in  terms  of  the  coefficients  . To  achieve  this  aim  we  must 
first  solve  the  following  algebraic  problem:  let 
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z(x)  = explv(x)] 


<018) 


y(x)  = 


z(x)  = 1 + ^ z„x" 


and  express  Zp  in  tern’s  of  yn . 

The  solution  of  this  problem  can  be  found  noting  that 


the  parenthetical  superscript  indicating  the  order  of  differentiation,  and  using  the  formula  for  the  differentiation  of 
a composite  function,  namely. 


V'  dm 

F[y(x)]  — -Fb(x)] 


* 

j~~|  fly  fr)(x  )//!]*/ 


•E  •/ 


and  the  sum  ir  extended  over  all  positive  integral  values  of  8 and  Sg  and  nonnegative  integral  values  of 
Sj,  i=  1,2,..  P - 1 , such  that 
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(026) 


the  nonnegative  values  of 

(D27) 

(D28) 

(D29) 

(D30) 

(031) 

(D32) 

(D33) 


the  turn  being  extended  over  all  positive  integral  values  of  8 and  s j and  nonnegative  integral  values  of 
1,2,...,  8-1  such  that 


£ 


/*/ 


p+  l 


(D35) 


Note  that  this  last  equation  implies  that  the  maximum  value  of  8 in  Eq.  (D34)  cannot  exceed 


o 

'max 


p+  1 


(D36) 


Let  us  proceed  now  to  discuss  the  odd  coefficients  g2p+i . namely  those  appearing  in  the  asymptotic  expan- 
sion of  Re  g(k ) for  ii.al  k . We  use  again  the  technique  which  we  used  above,  except  for  the  fact  that  we  now  inte- 
grate the  function 


F{k) 


k2p+\ 


2p+l 

*<»>-£ 


>1=0 


(D37) 


The  contour  is  the  same  as  in  the  previous  case;  ar  1 F ( k ) is  again  holomorphic  in  the  lower  half  plane.  Considering 
only  the  imaginary  part  of  the  result  we  secure  in  this  manner 


#2p+i 


= (-)P+1 


2 7P+i 

it 


f +oo 

p 

I dk  k2P+l 

Im  g(k)  ~ £ (-)"ft„( 2kT2n  x 

■'-oo 

n= o 

which,  using  Eqs.  (25)  and  (A5),  can  be  written  in  the  form 


(D38) 


£ip+i  = 


[l-l/Wi-2] 


[*-'  I /(*)!"’] 


(D39a) 


(D39b) 


The  second  line  of  this  equat.on  is  obtained  from  the  first  by  an  integration  by  parts. 

Of  course  this  equation  might  have  been  inferred,  in  a straightforward  but  nonrigorous  way,  expanding  directly 
under  the  integral  sign  in  Eq.  (28),  as  was  done  at  the  beginning  of  this  Appendix  to  obtain  the  coefficients  dp. 
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DISCUSSION 


Unidentified  speaker:  It  there  a v ay  of  relating  mare  directly  these  coefficients  in  the  asymptotics  of  eta  to  the 

potential? 

Calogero:  Yes.  Tliis  is  a question  concerning  the  direct  problem.  There  are  explicit  formulas  for  that. 

Grossi:  Would  it  be  possible  for  a monochromatic  particle  to  express  some  of  the  functions  that  lead  to  the  V0 

and  its  derivative  as  a function  of  the  input  parameter  of  the  particle. 

Caiogero:  This  is  the  other  problem,  the  problem  at  fixed  energy  and  varying  angular  momenta.  I have  not 

looked  into  this. 


THE  INVERSE  SCATTERING  PROBLEM  AT  FIXED  ANGULAR  MOMENTUM 
FOR  NONLOCAL  SEPARABLE  INTERACTIONS 
K.  Chadan 

Laboratoire  de  Physique  The'orique  et  Parvicules  Ele'mentaires 
Complexe  Scientifique  d'Orsay,  Universite  de  Paris  - Sud 

ABSTRACT  N78“116H 


As  is  the  major  thrust  of  Newton’s  paper,  the  problem  of  inverse  scattering  at  fixed  angular  momentum  is 
considered.  The  oroblem  is  particularized  to  the  case  of  nonlocal  separable  interactions. 

This  paper  is  a brief  survey  of  the  inverse  problem  for  nonlocal  separable  interactions.  As  we  shali  see,  this 
problem  can  be  solved  exactly  by  integration.  In  fact,  it  amounts  to  solving  singular  integral  equations  of  the 
Hilbert-MushkhelishviP  type,  which  have  been  studied  extensively  in  the  past  and  appear  in  many  areas  of  physics, 
including  theory  of  elasticity  and  dispersions  relations  in  high  energy  pt  vsics. 


INTROr  JCTION 


The  Schrodinger  equation  for  the  scattering  of  a nonrelativistic  particle  of  mass  M by  a general  nonlocal 
interaction  reads: 


(Ar  + £)y'.-fk,r)  = ju(  r ,:')^(k,r’)dr’ 


(1) 


where  we  have  chosen  the  units  snch  that  r.  ~ 2 M - 1 , E = 1 2 k1 12M  ~k},k  being  the  wave  numoer.  We  assume  now 
that 


E 


(2) 


r * lr  I r = !r'  cos (r  =— — e = ± 1 


rt 


We  sl.’U  study  later  separable  interactions  of  a more  general  k>nu.  Making  the  usual  partial  wave  decomposition 


\Kk  ,r)  Pg  [cos  tk . s »] 


(3) 
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we  obtain  the  reduced  radial  equation 


*e(*.r)  = e£t/8(r)  I *g(*''W)dr' 


U^(r)  = v^T  ru^r) 


(4) 


to  which  we  ha’-e  to  add  the 1 ».  .dary  condition  t|/g(k,0)  = 0.  The  asymptotic  form  of  \jj%  at  large  distances  should 
be 


*8i 


^48  sin  kr  --j  Cir  + 6g(t) 


(5) 


where  5^vic)  is  the  phase  shift. 


i 

i 

i 

5 


In  each  angular  momentum  state  - that  is,  for  each  8 — the  inverse  problem  consists  of  finding  the 
“potential”  f/{(r)  from  the  knowledge  of  6 j(J t),  which  is  assumed  to  be  known  for  all  positive  energies,  and  the 
energies  of  the  eventual  bound  states  (i.e.,  square-integrable  solutions  of  Eq.  (4).  fhese  later  are  divided  into  two 
groups:  those  with  negative  energy,  and  those  with  positive  energy. 

For  simplicity,  we  shall  consider  from  now  on  the  case  of  S- wave  (8  = 0).  All  the  results  can  immediately  be 
generalized  to  the  case  of  higher  waves,  as  we  shall  see.  Let  us  therefore  look  first  at  scattering  solutions. 


SCATTERING  SOLUTIONS 


We  have  to  solve 


= cN(k)U(r ) E = k*  e * e0 


U(r')Hk-r)ir' 


(6) 


This  equation  can  easily  be  solved  by  taking  the  Fourier  sine  transform* 


Hk.r ) 


sin  pr 
P 


dr 


Hk.p) 


sin  pr  , 

-£-ps 


dp 


(7) 


(8) 


•The  Fourier  transform  of  the  wave-functions  is  not  a function  but  a distribution.  However,  there  is  no  problem  in 
handling  such  harmless  objects.  We  use  only  Fourier  sine  transform  since  (k, 0)  = 0. 
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One  then  immediately  finds  from  the  asymptotic  form  of  \p  that  the  phase  shift  is  given  by 


tg6(k) 

n 


F(k) 

1+pI  F(p)i(p'  - k2)dp 


I 


Fik)  = — ek2[U{k)\ 3 

jt 


(9) 


(tO) 


where  P denotes  the  principal  value  of  the  integral,  and  U 


is  the  Fourier  sine  transform  of  the  potential: 


U(k)  = 


dr 


(111 


As  it  is  obvious  now,  we  have  to  impose  some  integrability  conditions  on  the  potential  in  order  for  the  above 
formulas  to  be  valid  It  is  easily  seen  that  the  following  rather  general  conditions  are  sufficient. 

U is  locally  integrable,  except  perhaps  at  the  origin, 


JV-. 


(J(r)  I dr<°° 


i U(r)  I dr  <°° 


(12) 


(13) 


where  rj  and  tj'  are  as  small  as  we  wish,  but  positive.  (We  use  these  symbols  to  denote  small  positive  quantities. 
Their  actual  values  may  differ  from  place  to  place.) 

Condition  (12)  ensoures  that  the  boundary  condition  is  satisfied  and  that  the  integral  appearing  in  the  right- 
hand  side  of  the  radial  Schrodmger  equation  is  convergent  at  its  lower  extremity,  whereas  conditions  (12)  and  (13) 
together  guarantee  the  existence  cf  this  integral  as  a whole,  the  existence  of  U(k)  for  all  energies  (fc>0),  and  the 
convergence  at  infinity  of  :!,e  integral 


f F<  p) 

Gik1)  = P | -r^TT  dP  04) 

J P k 

0 y 

which  appears  in  the  denominator  of  Eq.  (9). 

We  notice  now  that  si  we  > U(r)E  L t [0,  °°]  ,k(J(k)  is  everywhere  continuously  differentiable.  (One  can  dif- 
ferentiate Eq.  (1 1)  under  tne  integral  sign.)  Therefore,  the  principal  vaiue  integral  (14)  is  meaningful  and  defines  an 
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V' 

.1. 

-7 


everywhere  Holder-continuous  function  with  Holder  index  1 - e,  e as  small  as  we  wish;  in  fact,  one  knows  that 
IG(£  + h ) - G(E)  I ■ 0(  Ih  llog  \h  1).  Even  more  can  be  said  about  the  integral  (14).  It  can  be  shown  that  F(k), 
for  k-*°°,  behaves  like  k1-Tl.  where  tj  is  some  positive  quantity.  It  then  follows  that  the  principal  value  integral 
vanishes  when  k As  a consequence,  we  find,  according  to  Eqs.  (9)  and  (10)  that  tg  6(°°)  = 0;  and  for  k large 
enough,  the  sign  of  tg  6 (k)  is  that  of  -e. 


BOUND  STATES 


We  now  look  at  the  square-integrable  solutions  of  the  radial  equation.  First,  it  can  be  shown  that  the  bound 
states  with  positive  energy  are  characterized  b>  the  simultaneous  roots  of 


F(k)  = 0 


(15) 


1 +P 


f 


F(P ) 

( p'-k J) 


dp  =0 


(16) 


and  that  their  number  is  finite.  Equation  (16)  is  the  eigenvalue  equation,  wh'ca  also  gives  the  energy  of  the  true 
bound  state  (me  bound  state  with  negative  energy),  whereas  Eq.  (1 5)  ensures  that  the  radial  equation  has  one  solu- 
tion that  vanishes  in  the  limit  of  r -*•  <*>.  For  such  energies,  there  is  also  the  scattering  solution,  and  these  two  solu- 
tions are  independent  of  each  other.  Moreover,  it  can  easily  be  seen  that  at  the  energies  of  these  bound  states  the 
phase  shift  goes  through  nn,  with  n an  integer  (n  < 0).  Indeed,  it  is  clear  from  the  definition  (10)  that  the  roots 
of  Eq.  ( 1 5)  are  of  even  multiplicity  (at  least  of  order  2).  Therefore,  if  k0  is  such  a root,  we  have 


3 

3 k 


+ P 


I 


( P2-k 3) 


dp 


= 2k, 


k=kD 


I 


F{P) 


CP2  -*o2)2 


dp  0 


(17) 


where  we  have  written  F(p)  * F(p)  - F{k0).  As  is  obvious  from  (10),  this  quantity  has  the  sign  of  e.  The  last 
integral  is  meaningful  since  F(p)  has  a zero  of  order  at  least  two  at  p = k0  ■ It  follows  that  if  the  roots  of  Eqs.  ( 1 5) 
and  (16)  coincide  the  root  of  the  latter  is  simple.  From  the  definition  (9)  i*  follows  immediately  that  at  these  ener- 
gies the  phase  shift  crosses  the  value  nt.  Moreover,  it  does  so  downward  as  is  seen  by  differentiating  Eq.  (9),  and 
using  the  fact  that  e (17)  >0.  Once  the  phase  shift  has  crossed  a value  nn,  it  can  never  cross  it  again.  Of  course, 
it  can  reach  again  the  same  value  nn  at  some  other  energy,  but  this  happens  when  only  the  numerator  of  Ec  (9) 
vanishes,  and  so  6,  which  has  a zero  slope  at  this  energy,  does  not  cross  the  value  nn. 

As  for  the  bound  state  with  negative  energy,  it  is  given  by  the  root  of  the  eigenvalue  equation  (E  * -x) 


//( x)  = 1 + 


f 


F(P) 
ps  +x 


dp 


x>0 


(18) 


It  is  now  clear  that  there  is  no  true  bound  state  if  e * 1 , since  in  that  case  F{p)  is  positive.  We  therefore 
may  call  such  separable  potentials  repulsive.  However,  when  e»-l,  and 
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(19) 


H( 0)  * 


1 + 


dp  <0 


it  is  obvious  that  Eq.  (18)  has  one  positive  root  corresponding  to  a bound  state  of  energy  -x0. 


Figure  1 summarizes  typical  behaviors  of  the  phase 
shift.  Because  of  tg  6(“- ) = 0,  we  can  always  choose  8 
such  that 

5(®o)  = 0 (20) 

We  adhere  to  this  determination  from  now  on.  Therefore, 
the  Levinson  theorem,  which  relates  8(0)  - 8(°°)  to  the 
total  number  of  bound  states,  reads 


8(0)  = Nit 

(21) 

N = v + n 

(22) 

where  v is  the  number  of  bound  states  of  positive  energy, 
and  n is  the  number  of  true  bound  states.  We  leave  out 
the  possibility  of  a resonance  at  zei ' energy.  If  this  hap- 
pens we  must  add  ir/2  to  the  r.h.s.  of  Eq.  (21).  Since 
the  true  bound  state  occurs  only  when  the  sign  of  the 
potential  is  e = - 1 , and  since,  as  we  sav/  before,  the  sign 
of  6(Jfc)  for  sufficiently  large  k is  opposite  to  that  of  e, 
it  follows  that  >f  the  interaction  admits  a true  bound 
state,  6(Jt)  must  be  positive  (small)  when  k is  large. 
Therefore,  just  by  inspecting  the  sign  of  the  phase  shift  at  l 


rigure  la.  and  Figure  lb. 
energies,  we  can  determine  e. 


THE  INVERSE  PROBLEM 


To  determine  U(r)  from  the  phase  shift,  we  must  first  assume  that  the  latter  meets  all  the  requirements  for 
the  problem  to  have  a solution.  For  instance,  if  the  phase  shift  crosses  upward  the  value  tin,  n integer,  when  we 
increase  k,  we  are  sure  that  out  problem  with  one  separable  potential  has  no  solution. 

We  consider  first  the  inverse  problem  for  the  case  of  e = 1 . As  we  saw  before,  8(fc)  at  large  energies  must  be 
small  but  negative.  The  eventual  bound  states  with  positive  energy  are  then  determined  by  iust  looking  when  8 
crosses  (upward  !)  the  values  nit,  n = 0, 1, 2,  • • • , as  we  decrease  k from  infinity.  At  k * 0,  we  must  have 
6(0)  = vir.  To  find  the  potential  from  the  phase  shift,  we  have  to  solve  the  integral  equation  (9)  for  F(k).  Once 
this  quantity  is  found,  there  is  no  problem  for  finding  U(k)  via  Eq.  (9),  and  then  undoing  the  Fourier  transform 
(11).  For  simplicity,  we  consider  the  case  v * 1.  We  may  write  Eq.  (9)  in  the  form 

«.)  • I ♦ - f”  i,  (23) 

* JL  y-x~‘0 
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where 


We  assume  now,  according  to  our  previous  analysis,  that  5 is  Holder-continuous  with  some  positive  index  and 
that  it  behaves  like  k~^,  tj  some  positive  quantity,  when  *-+«•.  These  conditions  are  necessary  if  we  wish  to  find 
a potential  satisfying  Eqs.  (12)  and  (13).  They  are  also  sufficient  according  to  what  follows. 

To  solve  the  integral  equation  (23),  we  consider  first  the  function 


H(z)  - 1 + 


if 

*'0 


** 0>)»0‘) 

y-t 


iy 


(29) 


Assuming  a priori  the  Holder-continuity  of  $ and  the  proper  convergence  of  the  integral,  we  see  that  H(z)  is 
analytic  in  the  z plane  cut  from  0 to  +°°  (fig.  2).  Moreover, 


. /**)=!  (30) 

IZl-*oo 


in  all  directions. 

From  die  integral  Eq.  (23),  we  see  that  its  solution 
is  necessarily  of  the  form 


*(*)-  tf(x+)aelim<jtf(x  + /e)  0<x<«  (31) 


On  the  other  hand,  the  discontinuity  of  H(z)  on  the 
cut  is  given  by 

ff(x+)  - H(x_)  - -21  sin  ?(*)*«(*  )*(*)  (32) 

Upon  substituting  0 by  its  value  (31)  we  find 
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e2 «?(x)//(x+)  _ //(*_)  = 0 0 < .r  < ' 


(33) 


which  is  a homogeneous  Riemann-Hilbert  type  equation.  !t  is  now  easily  verified  that  a particular  solution  of  this 
equation,  satisfying  Eq.  (30)  is  given  by 


H(z)  = exp  Mz)l 


(34) 


J0 


W(Z)  = - — | -6-^-  Ay 


y-z 


(35) 


Indeed,  from  our  assumptions  on  the  phase  shift,  we  have 


, Um  w{z)  = 0 
lzl-*o° 


(36) 


in  all  directions.  Moreover,  co(z)  is  a well-defined  function  on  the  cut,  except  perhaps  at  z = 0,  where  its  behavior 
is  given  by 


w(z)  * log  z + c5(z) 

IT 


(37) 


cj(z)  being  finite  when  z -*•  0,  and  6 (0)  = 6(0)  = n (one  positive  energy-bound  state).  Therefore,  the  function 
exp[w(z)]  has  only  a simple  zero  at  z = 0.  According  to  Eqs.  (34)  and  (35),  0(x)  is  given  by 


<t>(x)  = e“(X+)  = e<*(*)  <.-/«(*) 

(38) 

/•oo  ~ 

a(x)  = --—/>  I MZJ- 

* Jo  y~x 

(39) 

We  have  to  verify  now  that  this  indeed  satisfies  the  integral  equation  (23).  This  is  most  easily  done  since  by 
Cauchy’s  theorem  we  have 


e--Sm5>)  e*<y)e-tS<jr)4y  = lim 
y-x-iO  e-*0 


)J 


= ««(*+)_  ] = <p{x)~  1 


(40) 


where  the  contour  T is  shown  on  figure  2,  R being  the  circle  with  a very  large  radius.  (With  w(z)  regular  at  z = 0, 
it  is  obvious  that  the  contribution  of  the  small  circle  enclosing  the  origin  vanishes  when  its  radius  r tends  to  zero.) 
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Note  that  #(x)  is  regular  at  x * 0.  In  fact  it  has  a simple  zero  there.  It  is  also  obvious  from  Eqs.  (38)  and  (39) 
that  is  Holder-continuous  with  the  same  index  as  the  phase  shift,  and  remains  bounded  when  x-**°.  Our  a 
priori  hypothesis  on  0 for  solving  the  integral  equation  is  therefore  justified. 

Let  us  consider  now  the  homogeneous  equation 


*o(*) 


1 

ir 


*»(y) 

y —x  — iO 


«o(y)  dy 


(41) 


,0  which  we  associate  the  function 


1 

Ho(y)  * — I 

* Jo  >"Z 


$oO)  dv 


Assuming  a priori  that  0O  is  Holder-continuous  . . . etc,  we  get 


lim  H0{z)  - 0 


Izl 


(42) 


(43) 


in  all  directions.  Proceeding  as  for  H , we  obtain  once  more 


H0(x+)-H0(x-)  * 0 


(44) 


Using  the  Ansatz 


(45) 


where  w(z)  is  again  defined  by  Eq.  (35)  and  A,  are  arbitrary  constants,  we  find 


$o(*) 


x 


(46) 


Again  it  can  be  checked  by  contour  integration  that  Eq.  (46)  is  indeed  a solution  of  Eq.  (41),  and  that  it  has  all  the 
required  properties. 

The  general  solution  of  our  integral  equation  it  therefore 


#*)  « 


(47) 
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Going  back  to  our  earlier  notations,  we  obtain 


(48) 


(49) 


This  solution  depends  on  the  arbitrary  constant  A . However,  from  its  definition,  F must  keep  a constant 
sign  for  all  values  of  k.  Here  F should  be  positive  since  e = 1.  On  the  other  hand,  we  know  that  sin  6 changes 
sign  at  the  energy  of  the  bound  state,  which  occurs  at  E - k02 , k0  being  the  point  where  the  phase  shift  crosses 
the  value  zero  [fig.  1(a)] . It  follows  that  we  must  have 


A = ~kD7 


(50) 


in  order  for  F given  by  Eq.  (48)  to  keep  a constant  (positive)  sign.  We  therefore  see  that  the  solution  is  completely 
determined  by  the  phase  shift  alone,  since  this  also  determines  k0.  Once  F is  known,  we  obtain 


k U(k)  = \/  F(k) 


(51) 


which  gives  in  turn  U(r)  by  inverting  the  Fourier  transform  (1 1).  It  can  then  be  checked  that  V satisfies  the  con- 
ditions (12)  and  (13)  since  F is  Holder-continuous  and  decreases  like  k~^,  tj  >0,  when  k-*M.  From  Eqs.  (48) 
and  (49)  it  is  clear  that  these  properties  hold  for  F if  they  hold  for  the  phase  shift. 

The  case  when  several  positive  energy  bound  states  occy  at  Fa  = kal , a = 1, 2, . . . , v,  and  e = 1 , is  quite 
similar,  and  leads  to 

F(k)  = - -y  sin  6(*)  > j”|  ^1  - ^ (52) 

a 


We  consider  now  the  case  where  e - -1,  and  where  there  are  two  bound  states,  one  with  positive  energy 
E0  - k02  , and  the  other  a true  bound  state  with  negative  energy  E * -y7 . A typical  behavior  of  the  phase  shift  in 
this  case  is  shown  on  Figure  1(b).  Since  6(0)  = 2rr,  we  get  that  u(z)  defined  by  Eq.  (35)  has,  according  to  Eq.  (37) 
a double  zero  at  the  origin.  Proceeding  as  before,  and  taking  into  account  the  eigenvalue  equation  ( 1 8)  with  r - y , 
and  the  fact  that  the  solution  F(k)  must  keep  a constant  (negative)  sign  for  all  A: , we  obtain 

F(k)  = ~ sin  6 (*)  ea(k^  (l  - ^1  + —■  j (53) 

In  this  case,  we  see  that  the  potential  is  determined  unambiguously  from  the  phase  shift  and  the  energy  of  the  true 
bound  state. 


5-89 


The  generalization  of  the  above  method  to  higher  partial  waves  is  now  obvious.  We  only  have  to  use  the 
Hankel  integral  transform 


HKp)  * 


Jfrin&r) 

k »+! 


dr 


(54) 


instead  of  the  sine  transf  orm  (11).  This  permits  us  to  eliminate  the  centrifugal  potential  and  to  obtain  equations 
quite  similar  to  those  obtained  for  the  S wave.  The  conclusions  are  then  quite  similar. 


MORE  GENERAL  INTERACTIONS 


The  method  we  have  been  discussing  can  be  used  with  more  general  nonlocal  interactions.  The  first  generaliza- 
tion consists  in  consideiing  the  interaction 


l/(r,r')  = V(r)  6(r  - r')  + e git  g(r)  u g(r')  /’gfcos  0) 

e 


(55) 


where  the  local  potential  V is  assumed  to  be  known.  The  radial  equation  now  reads 

P 8(8  + 1)*] 

*t  + |£  - h " vh  + e8f/fiW  J r)  dr' 


(56; 


Here,  the  inverse  problem  consists  of  finding  when  the  phase  shift  and  the  local  potential  V are  both  known. 
For  this  purpose,  we  assume  that  the  local  potential  satisfies 


f°°r  I V(r)  1 dr  < °» 

0 

We  can  now  solve  (in  principle)  the  radial  equation  with  V alone: 

„ f e(E+i)l 
+ . v{rHt 


(57) 


rio  (2 * + !)!! 


U +0(1)1 


(58) 


(59) 


and  define  the  integral  transforms 


590 


(60) 


...  •<-*  Stea- 


o 


Ut(r)  = J_“&8(£)0e(£,r)  dp(£) 


(61) 


where  p(£)  is  an  appropriate  measure,  given  in  terms  of  the  Jost  function  of  V.  (These  integral  transforms  cr.n  be 
slrown  to  have  properties  quite  similar  to  those  of  Hankel  transforms  used  earlier.  They  reduce,  of  course,  t j the 
latter  when  /=  0.  For  the  definition  of  the  spectral  measure  p(£),  see  Newton’s  discussion  at  the  beginning  of 
this  chapter.)  If  we  write  the  phase  shift  as  (for  simplicity,  dropping  the  subscript  ?) 


5 a 6 y + 5 y 


(62) 


where  6 y is  the  known  phase  shift  due  to  V alone  - that  is,  the  phase-shift  of  Eq.  (58)  - we  obtain  for  by  an 
equation  quite  similar  to  (9)  and  (10)  where  U is  now  given  by  Eq.  (60).  Therefore,  we  again  have  a Hilbert- 
Mushkelishvili-type  equation  for  the  inverse  problem,  which  can  be  solved  under  the  same  h'  potheses,  as  before,  on 
6 fj.  The  conclusions  are  quite  similar:  if  we  know  6,  and  therefore  bjj,  and  the  energies  of  true  bound  states 
(£  < 0),  and  if  bjj  satisfies  certain  conditions  similar  to  those  discussed  earlier,  we  can  find  U(r)  unambiguously. 
(Here,  too,  one  can  show  that  the  positive  energy  bound  states  are  given  by  those  values  of  k where  6 y(k)  crosses 
downward  mt  when  we  increase  k. ) 

Another  generalization  consists  of  having  several  separable  potentials  in  each  angular  momentum  state: 


l/(r,r)  ■=  K(r)6(r-r')  + £ 

£ 


s£ 

]C  tyfrM  ■ ■ ' 


(63) 


The  right-hand  side  of  the  radial  equation  then  reads 


♦E  egj  ^£f(r)  J Ufa (r>)  'Mr') 

**i 


(64) 


There  are  two  ways  of  solving  this  equation:  either  we  solve  it  step  by  step,  using  the  integral  transformed  defined 
by  the  wave-function  of  the  previous  step,  exactly  as  for  Eq.  (56);  or  we  use  the  integral  transform  defined  by  08 
throughout,  leading  to  a set  of  algebraic  equations,  to  determine  the  relation  between  the  potentials  and  the 
phase  shift.  Once  more,  one  is  led  to  singular  integral  equations  of  the  Hilbert-Mushkelishvili  type,  and  finally  to 
conclusions  quite  similar  to  those  reached  before  the  inverse  problem.  In  this  case,  whatever  and  the  negative 
energy  bound  states  may  be,  we  can  always  find  an  appropriate  sei:  of  that  solves  the  problem.  In  other  words, 
$8  may  have  any  shape  if  we  choose  Sg  large  enough. 

There  has  been  recently  some  revival  of  interest  in  the  use  of  separable  interactions  for  studying  the  properties 
of  the  nuclear  matter.  For  references,  see  Sabatier’s  paper  earlier  in  this  chapter. 
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We  end  up  by  giving  as  an  amusing  and  somewhat  surprising  example  the  construction  of  a class  of  completely 
transparent  nonlocal  potentials.  We  start  from  any  totally  repulsive  (everywhere  positive)  well-behaved  local  poten- 
tial, that  satisfies  Eq.  (57).  We  can  then  write  its  phase  shift  for  any  given  8,  5 in  the  form 


8K<R> 


S*  I 


(6 ... 


where  each 


has  the  appropriate  shape  for  the  inverse  problem  for 


6 K + 


5e 

8^(3  0) 


to  be  solvable  step  by  step  as  before.  We  end  up  in  this  way  with  a total  phase  shift  that  is  zero  at  all  energies.  Doing 
this  for  every  8,  we  obtain  the  nonlocal  potential  (63)  for  which  all  the  phase  shifts  are  zero  for  all  energies  Notice 
that  there  is  no  convergence  problem  for  the  sum  over  s in  Eq.  (63)  since  for  8 large  enough,  the  local  phaie  shift 
8 j/R)  is  very  small  at  all  energies.  This  means  that  there  is  some  8 beyond  which  we  can  take  Sj  ■ 1 . The  local 
potential  and  tht  decomposition  of  6 j/®)  into  the  sum  (65)  being  to  a large  extent  arbitrary,  we  obtain  thus  a large 
class  of  nonlocal  completely  transparent  potentials. 

The  same  method  can  be  used  to  obtain  a class  of  nonlocal  potentials,  all  corresponding  to  a given  set  of  phase 
shifts,  of  which  the  above  example  is  of  course  a particular  case  [8g(J:)  * 0 for  all  8 and  *]. 

These  simple  examples  show  that  in  general  the  inverse  problem  for  nonlocal  potentials  has  not  necessarily  a 
unique  solution  if  one  is  only  given  the  S matrix  on  the  energy  shell. 
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DISCUSSION 


Degasperis:  I have  applied  this  theoiy,  especially  Calogero's  work,  to  another  field  Two  years  ago  I applied  a 

high-energy  expansion  to  a plasma  diagnostic  problem.  In  that  case,  the  high-frequency  behavior  of  the 
electric  potential  outside  a plasma  cylinder  provided  the  input  data  for  reconstructing  the  electron  density 
profile  inside  the  plasma 
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6.  ELECTROMAGNETIC  SCATTERING 


This  chapter  is  concerned  with  the  broad  range  of  practical  inversion  problems  in  the  field  of  electromagnetic 
scattering;  these  problems  are  also  discussed  by  Mittm,  Bojarski,  Sondhi,  and  Prosser  in  Chapter  8.  Certain  of  the 
problems  are  amenable  to  full-wave  treatment,  while  others  require  geometrical  or  phy  lical  optics  assumptions.  The 
authors  invariably  are  concerned  with  several  facets  of  the  problem: 

1 . Characterization  of  the  scattering  data.  One  wishes  to  know  the  necessary  and  sufficient  conditions  to  ensure 
that  scattering  amplitude  functions  really  are  scattering  amplitudes  of  properly  posed  problems;  such  conditions 
usually  are  of  the  form  of  analyticity  requirements  for  exact  problems. 

2.  Practicality  of  solutions.  Algorithms  that  invert  the  scattering  data  to  yield  the  scatterer  function  should  be 
simple  and  “practical”;  they  should  use  a minimum  of  data. 

3.  Relationship  between  approximate  and  exact  techniques. 

4.  Stability  criteria.  Small  errors  in  scattering  amplitudes  should  not  lead  to  large  errors  in  characterization  of  the 
scatterer. 

5.  Realizability  criteria.  These  are  requirements  on  scattering  properties  that  lead  tc  physical,  realizable  scatterers. 

6.  Extension  of  inversion  techniques  to  dissipative  systems. 


H.  E.  Moses  organized  and  chaired  the  session  devoted  to  electromagnetic  scattering. 
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A number  of  exact  and  approximate  methods  for  solving  die  inverse  scattering  problem  for  transmission  lines 
are  reviewed.  In  particular,  the  application  to  transmission  lines  of  Marcenko 's  [19SS]  version  of  the  Gelfand- Levitan 
exact  method  for  the  quantum  mechanical  problem  is  compared  with  a more  direct  approach  based  on  a different 
version  of  the  Geifand-Lev  in  method.  In  addition,  some  aspects  of  the  lack  of  uniqueness  of  solutions  are  discussed, 
and  some  open  questions  > elated  to  the  inverse  scattering  problem  are  suggested. 


INTRODUCTION 


Transmission  line  scattering  is  distinguished  here  by  its  one-dimensional  character  and  the  fact  that  waves 
propagate  along  the  line  in  two  directions,  from  left  to  right  and  from  right  to  left.  It  takes  'lace,  typically,  in  a 
cylindrically  symmetric  medium  that  extends  to  infinitv  in  the  direction  of  propagation.  The  physical  properties  of 
such  a medium  are  stratified  - they  vary  only  in  the  direction  of  propagation  - although  it  may  be  enclosed  in  a 
cylindrical  structure. 

A complete  description  of  electromapietic  wave  propagation  in  a cylindrical  structure  requires  representing  the 
fields  as  infinite  sums  of  modes.  Each  rield  in  a mode  has  a vector  factor  which  depends  only  on  coordinates  trans- 
verse to  the  c'-iindrical  axis  and  a scalar  factor  which  depends  only  on  position  along  the  cylindrical  axis.  In  the  loss- 
less case,  in  each  mode  a pair  of  such  scalar  factors,  corresponding  to  die  electric  and  magnetic  fields,  satisfies  the 
transmission  line  equations 


— = iuL(z)I 
d z 

~ = i wC(z)  V 


0) 


where  V u associated  with  the  electric  field,  / is  associated  with  the  magnetic  field,  <•>  is  the  frequency , L(z)  is 
the  effective  distributed  inductance,  and  C(z)  is  the  effective  distributed  capacitance  \Afarcuvitz,  1964,  pp.  I-SJ . 

Other  than  cylindrical  structures  can  lead  to  a transmission  line  description  of  wave  propagation,  e.g.,  a spherically 
symmetric  ipeoium  [Mcuruvitz,  1964,  pp.  29-54] . The  requirement  is  that  solutions  of  the  bask  governing  equations 
be  separable  into  factors,  one  of  which  it  a function  of  a single  variable  whose  values  denote  positions  along  a propaga- 
tion path.  This  variable  can  represent  curvilinear  displacement  and  might,  for  example,  be  one  of  the  angular 
coordinates  in  a spherical  system. 


The  transmission  line  equations  (!)  c»n  be  generalize  *■  that  the  V and  / are  n component  vectors  and  L 
and  C are  nXn  matrices;  such  s generalisation  is  necessary,  for  example,  when  the  medium  is  anisotropic.  Then 
the  physical  prototype  is  a system  of  coupled  2n  wire  transmission  lines. 
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For  certain  nonseparable  cases  the  transmission  line  formalism  can  provide  an  approximate  model  for  wave 
propagation.  For  example,  it  can  be  used  for  a tapered  waveguide  with  slowly  varying  cross  section  [ Shelkunoff , 
1943,  pp.  405-406) . 

The  physical  meaning  of  the  parameters  in  Eq.  ( 1)  may  be  modified  on  occasion.  For  example,  Becher  and 
Sharpe  [1969]  have  applied  the  transmission  line  formalism  to  the  problem  of  determining  the  conductivity  of  the 
earth  from  its  low  frequency  propagation  effects.  In  this  case,  the  fimetions  C(z)  and  Z.(r^  are  proportional  to 
the  square  root  of  the  conductivity  and  its  reciprocal. 


DIRECT  VS  INDIRECT  SCATTERING 


The  direct  scattering  problem  for  the  transmission  line  is  equivalent  to  the  problem  of  calculating  its  input 
impedance  at  its  terminals  (Kay,  1963) , which  can  be  taken  to  be  ± «•  in  general.  It  is  assumed  that 


lim  £(z)  = £, 

Z o© 


lim  C(r)  = Ci 

7 •+  OO 


(2) 


lim  L(z)  - L o lim  C(z)  = Co 

2 *'♦  c©  2 » 


where  L|.  C|,  L0,  C0  are  all  constants. 

Crder  appropriate  conditions  Eq.  (1)  also  has  the  property  that  a solution  exists  for  which  the  asymptotic 
behavior  ot  V is 


V ^ exp(<*0  z)  + r(<*>)exp(-**o  *)  for  z -*•  -«• 

V *\»  f(w)exp(ik|  z)  for  z -*•  +'» 

where 

*o  - o>{LoCo)'n  *,  =u(LiC,)'/2 


(3) 


Then 


/ “V  K0{exp(i*c  t)  - Kw)exp(-i*0z)  1 for  z 


I 'v*  y,(w)exp(iJt,  z) 


for  z -*•  +oo 


(4) 


where 


Yt 


The  quantities  Jk(-  are  propagation  constants,  y • characteristic  admittances,  2;-  characteristic  impeaances; 
r(u>)  is  the  left  amplitude  reflection  coefficient,  and  f(eo)  is  the  right  amplitude  transmission  coefficient. 

The  asymptotic  relations  (3)  and  (4)  correspond  to  a wave  of  unit  amplitude  incident  from  the  left,  partially 
reflected,  and  partially  transmitted.  If  a wave  of  unit  amplitude  is  incident  from  the  right,  a reflected  wave  with  amp- 
litude equal  to  a reflection  coefficient  p(w)  and  transmitted  wave  with  amplitude  equal  to  a transmission  coefficient 
r(w)  will  be  generated.  The  coefficients  r(o>',  p(u),  f(u),  r(u>)  are  dements  of  a 2 X 2 matrix  called  the  scatter- 
ing matrix.  The  direct  scattering  proHetu  can  also  be  formulated  as  the  calculation  of  the  scattering  matrix  of  the 
transmission  line.  With  this  formulation,  only  the  function  V need  be  considered  in  the  solution  of  Eq.  (1). 

In  the  inverse  scattering  problem  some  part  of  the  scattering  matrix,  usually  the  reflection  coefficient  r(  w),  is 
given  and  it  is  required  to  calculate  L(i)  and  C(z\  or  their  equivalents.  Most  often  r(o)  is  a prescribed  function 
of  frequency,  although  in  some  cases  the  reflection  coefficient  is  given  at  a fixed  frequency  and  ar-  a function  of  some 
other  parameter  such  as  angle  of  incidence. 

The  inverse  scattering  problem  for  quantum  mechanics  is  similar  in  that  some  part  of  the  scattering  matrix 
associated  with  an  unknown  potential  is  given  as  a function  of  die  energy,  and  it  is  required  to  determine  the 
scattering  potential.  However,  in  quantum  mechanics  the  problem  is  essentially  three  dimensional  and  therefore 
leads  most  naturally  to  a somewhat  different  mathematical  formulation,  even  after  separation  of  variables,  than  does 
the  transmission  line  propagation  problem.  This  is  also  true  of  certain  inverse  scattering  problems  for  the  electro- 
magnetic field  - when  a Add  is  scattered  in  a homogeneous  medium  by  an  obstacle  of  unknown  shape  and  >i  is 
required  to  determine  die  dupe  or  other  geometrical  and  physical  properties  of  the  obstacle  from  a knowledge  of  the 
scattered  field  at  large  distances. 

It  is  true  that  for  spherically  symmetric  potentials  the  quantum  mechanics  inverse  scattering  problem  reduces  to 
one  for  the  Schwinger  equation  in  a single  variable,  the  radial  coordinate.  However,  the  problem  is  still  essentially 
three  dimensional  and  differs  from  that  of  die  transmission  line  in  that  it  is  usually  formulated  in  terms  of  the  phase 
shift  of  a standing  wave  rather  than  reflection  coefficient  of  a propagating  wave. 

For  quantum  mechanics  die  ground  state  radial  Schrodmger  equation  has  the  form 

^ + IE-JV)]*  = 0 (5) 


for  the  interval 


0 < r < - 


with  the  boundary  condition 
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*(Q)  = 0 


(6) 


and  the  asymptotic  condition 


4>  'v*  i4(£')  sin[r  + d(£)]  as  r •*  <* 


(7> 


The  inverse  scattering  problem  is  to  determine  P(r)  from  a knowledge  of  0(£).  The  transmission  line  problem  can 
be  transformed  into  one  for  a differentia]  equation  similar  to  Eq.  (S);  however,  the  interval  over » hich  the  independent 
variable  ranges  is  (-«>.  <»)  rather  than  (0,  °°)  and  the  boundary  condition  (6)  is  not  imposed. 

Mathematically,  the  difference  between  the  two  problems  is  primarily  that  the  spectrum  associated  with  the 
quantum  mechanics  problem  is  simple,  whereas  that  associated  with  the  transmission  line  problem  has  a multiplicity 
of  2.  Because  of  this  difference  the  Ge'.fand-Levitan  method  of  solving  the  in/erse  scattering  problem  of  quantum 
mechanics  [Gclfand.  und  Levitan.  1951]  must  be  modified  when  it  is  applied  to  the  transmission  line  problem. 

The  Marcenko  variation  of  the  Gelfand-Levitan  method  [Marcenko,  1955,  Agranovich  and  Marcenko.  1963]  was 
originally  applied  to  the  problem  of  quantum  mechanics  but  has  been  used  by  Sharpe  [1963]  in  solving  the  synthesis 
problem  for  transmission  lines.  This  was  possible  because  in  the  Marcenko  method  the  quantum  mechanics  model 
given  by  Eqs.  (5),  (6)  and  (7)  is  viewed  not  as  a standing  wave  picture,  but  as  one  for  a propagating  wave  incident 
from  the  right  on  a potential  with  a short  circuit  terminal  impedance  condition  at  0 and  a wave  reflected  to  the 
right.  The  reflection  coefficient  in  this  case  is  exp (2r'd).  When  Marcenko's  method  is  applied  to  the  transmission 
line,  however,  an  unnecessary  complication  arises  because  of  the  artificial  boundary  condition  (Eq.  (6».  This  will  be 
discussed  in  more  detail  later. 

The  difference  in  form  between  Eqs.  ( 1)  and  (5)  is  not  as  crucial  as  it  might  seem  at  first.  A transformation 
exists  that  will  change  Eq.  (1)  into  the  form  of  Eq.  (5)  and  thus  reduce  the  inverse  scattering  problem  for  the  trans- 
mission line  to  one  for  the  Schrodinger  equation  [Kay  and  Moses,  1956] . The  transformation  is  given  by 

x=[  4>=Z_lJ  * = KZ-W  (8) 

/ Kz) 


where 


v = (Lcrln 

is  the  wave  velocity  and  Z = L/C  is  the  characteristic  impedance  of  the  medium;  then  & will  satisfy  Eq.  (5)  with 
P(x)  related  to  <1>  K 

d2  <t> 

“ -£(*)*  = 0 (9) 

dx* 

If  the  inverse  scattering  problem  for  Eq.  (5)  is  solved  to  determine  P(x).  <t>  can  then  be  determined.  However,  the 
range  of  x in  this  case  is  ( -*»,  °°)  rather  than  [0,  «•»]  as  in  the  problem  of  quantum  mechanics;  compare  Moses 
and  deRidder,  [1963]. 


6-5 


The  relation  between  Eqs.  (1)  and  (S)  can  be  more  direct  as  when  a plane  wave  at  a fixed  frequency  is  incident 
on  a linearly  stratified  medium  at  various  angles  and  the  reflection  coefficient  is  given  as  a function  of  the  angle  of 
incidence  rather  than  frequency.  If  L is  constant  and  only  C varies,  I can  be  eliminated  from  Eq.(l),  and  the  equa- 
tion for  V becomes 


£v 

dra 


cos*  a 


-mj(2)]k  = 0 


(10) 


where  k0  is  the  wave  number  in  the  homogeneous  region  assumed  to  he  to  the  left  of  the  inhomogeneous  medium, 
a is  the  angle  of  incidence,  and  p1  (z)  is  the  differin  '*' 1 .tween  the  square  of  the  wave  number  in  the  inhomogeneous 
region  and  that  in  the  homogeneous  region. 


QUESTIONS  O'  UNIQUENESS 


In  view  of  Eqs.  (8)  and  (9)  the  inverse  scattering  problem  for  the  transmission  line  clearly  uoes  not  have  a unique 
solution,  since  the  velocity  of  propagation  v(z)  can  be  any  continuous  positive  function.  However,  in  most  propaga- 
tion media  of  practical  importance  the  permeability  is  that  of  vacuum,  and  therefore  only  the  permittivity  varies.  In 
such  a case,  there  will  be  a well-defined  relation  between  L and  C depending  on  the  particular  mode  being  considered, 
and  this  relation  will  resolve  the  ambiguity. 

Under  suitable  restrictions  a knowledge  of  the  reflection  coefficient  r(w)  is  sufficient  to  provide  a unique 
solution  to  the  inverse  scattering  problem.  However,  a knowledge  of  the  transmission  coefficient  r(w)  is  not 
sufficient  to  determine  a unique  solution,  in  general; compare  Karp  and  Shmoys  [19SS]  ,Kay,  [19601 . This  fact  is 
implied  by  a consideration  of  whether  a knowledge  of  r alone  or  t alone  is  sufficient  to  determine  the  remaining 
elements  of  the  scattering  matrix.  For  physically  reasonable  transmission  lines  the  scattering  matrix  will  be  unitary 
for  real  w and  the  principle  of  reciprocity  will  hold.  Its  elements  thus  will  have  the  following  properties  [cf.  Heim 
mi  Sharpe,  1967]: 


( for  <o  real)  |r|*  + |r|*  = |tl*  + lp|2  = 1 (conservation  of  energy) 


t = t (reciprocity) 


(11) 


(for  real)  f(-«)  * f*(w),  r(-u)  = r*(w),  p(-w)  = p*(«) 


r,  p,  t are  analytic  functions  of  u>,  regular  in  the  upper  half  plane 


r(w)  * 1 + o(co)  for  large  w on  the  real  axis  and  in  the  upper  half  plane 


r(w)  has  no  zeroes  in  the  upper  half  plane  or  on  the  real  axis  (passive  line) 


The  properties  (11)  imply  that  f(w)  and  p(w)  can  be  determined  uniquely  from  knowledge  of  r(w), 
but  r(w),  cannot  be  determined  uniquely  from  a knowledge  of  f(w).  In  fact,  for  « in  the  upper  half  plane 
f(w)  is  given  by 
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(12) 


log  t( u)  = 


log[l  - r(a)r(-o))da 
a-  w 


When  r(co)  is  a rational  function  of  to 


M 

n (co  + «,) 

/(«)™ (13) 

n(w-\,) 

t=i 


where  the  X/  are  poles  of  r(co)  in  the  lower  half  plane  and  the  k;  are  the  n roots  of 


1 - 'fw)r(-w)  = 0 


(14) 


in  the  ipper  half  plane  [Xav,  1960] . 

These  results  are  based  on  the  assumption  that  the  differential  equation  does  not  have  a discrete  spectrum, 
which  is  the  case  when  the  scattering  matrix  is  regarded  as  a function  of  frequency.  However,  a discrete  spectrum 
can  occur  along  with  the  continuous  one  in  the  case  of  a plane  wave  incident  on  a linearly  stratified  medium  at  a 
fixed  frequency  when  r is  regarded  as  a function  of  the  angle  of  incidence  so  that  the  differential  equation  for  the 
problem  is  Eq.  (10).  In  this  case  the  inverse  scattering  problem  has  an  infinite  number  of  solutions  for  a given  r(w). 
These  occur  when  the  condition  that  r(oj)  be  regular  in  the  upper  half  plane  is  violated  by  the  assignment  of  poles 
for  r( cj)  on  the  positive  imaginary  axis.  When  these  poles  are  not  also  poles  of  r(w),  additional  solutions  occur. 
For  example,  an  infinite  set  of  media  exists  for  which  the  reflection  coefficient  is  identically  zeio  for  all  angles  of 
incidence  [Kay  and  Moses.  19566], 


EXACT  METHODS  OF  SOLUTION 


Exact  solutions  of  the  inverse  scattering  problem  have  been  based  on  a general  technique  given  by  Gelfand  and 
Levitan  (1951]  for  the  differential  equation  (5).  This  is  a method  for  determining  the  function  P(r)  fromaknowl- 
edge  of  the  spectral  measure  function  m(£)  [Titchmarsh,  1946;  Newton,  1966,  p.  616;  Kay  and  Moses,  1955]  in 
terms  of  which  the  spectral  resolution  associated  with  the  differential  operator  and  the  boundary  conditions  of  the 
scattering  problem  is  defined.  In  applying  the  Gelfand-Levitan  technique  it  is  usually  necessary,  as  a first  step,  to 
conrtruct  the  spectral  measure  function  from  a knowledge  of  elements  of  the  scattering  matrix. 

In  the  case  of  the  quantum  mechanics  problem  for  which  the  given  scattering  data  is  the  asymptotic  phase 
shift,  this  is  accomplished  by  the  use  of  a dispersion  relation  derived  from  analytic  properties  of  the  scattering  matrix 
in  the  complex  plane.  For  the  transmission  line  problem,  where  the  scattering  data  is  the  amplitude  reflection  coeffi- 
cient r( to),  such  a procedure  is  not  necessary  since  in  this  case  the  spectral  measure  function  depends  only  on  r(w) 
itself  [Kay  and  Moses,  1956o]. 
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Marcenko  [1955]  used  a modification  of  the  technique  to  solve  the  quantum  mechanics  inverse  scattering 
problem  directly  in  terms  of  the  phase  shift.  His  treatment  of  the  problem  can  be  interpreted  as  taking  the  transmission 
line  point  of  view,  wherein  a wave  incident  from  +<*°  on  the  right  is  reflected  from  a transmission  line  which  is  short 
circuited  at  0.  The  reflected  wave,  propagating  from  left  to  right,  has  the  behavior 

^ S(£)exp(i\/?r)  as  r-»+» 


where  for  real  positive  £ the  factor  S(£)  has  unit  amplitude  and  a phase  which  is  twice  that  of  the  phase  shift 
associated  with  the  usual  version  of  the  scattering  problem.  The  transmission  line  reflection  coefficient  is  S(£);  the 
fact  that  it  has  unit  amplitude  is  a result  of  the  short  circuit  at  zero  and  conservation  of  energy  [Kay,  1963] . 

A solution  to  the  inverse  scattering  problem  associated  with  Eq.  (1)  for  a short-circuited  line  has  been  given  by 
Brown  [1966]  using  Marcenko’s  method.  Sharpe  [1963]  used  Marcenko’s  method  to  solve  the  closely  related  syn- 
thesis problem  for  Eq.  (1)  without  restricting  the  problem  to  a short-circuited  line.  A direct  version  of  the  solution 
was  given  by  Heim  and  Sharpe  [1967] , who  were  able  to  avoid  the  assumption  of  a short-circuited  line  at  the  cost  of 
having  to  require  a knowledge  of  both  the  reflection  coefficient  r( to)  and  the  transmission  coefficient  r(u>). 

A physically  oriented,  heuristic  derivation  of  a solution  to  the  inverse  scattering  problem  can  be  given  [Kay, 
1960]  tha*  shows  why  the  transmission  line  approach  leads  to  a solution  depending  only  on  a knowledge  of  the 
reflection  coefficient.  The  derivation  depends  on  associating  the  partial  differential  equation 


V„  -Pix)V-  Utt  = 0 


(15) 


with  Eq.  (5)  and 


tw  - CW  • o (16) 

with  the  unperturbed  form  of  Eq.  (5)  - the  form  which  occurs  when  P vanishes.  It  is  then  assumed  that  U is 
related  to  f/0  by  a linear  transformation  which  can  be  expressed  in  the  form  of  an  integral  operation  on  t/0. 

The  assumption  is,  explicitly, 


U(x,t)  « U0{x,t)  + 


K(x,y)U0(y,t)&y 


(17) 


where  K(x,y)  is  the  kernel  of  the  integral  operator. 
If  the  function  R(r)  is  defined  by 


*(*)  = 


(18) 


R(x  + t ) will  represent  the  asymptotic  reflected  wr*?  due  to  an  incident  wave  5(x  - t),  where  6 is  ths  Dirac 
delta  function.  The  function 


6-8 


U0 (x.t)  m 6(x  - 0 + R(x  + t) 


(19) 


will  then  represent  the  asymptotic  form  of  U when  x -»  The  function  R( r)  vanishes  for  r < 0 if  /•(«)  is 
analytic  and  o(l)  in  the  upper  half  plane.  This  will  be  the  case  when  P(x)  is  zero  for  negative  x. 

If  U(x,t)  that  satisfies  Eq.  (17)  is  the  proper  solution  to  Eq.  (15)  for  the  scattering  problem  it  must  vanish  for 
x > t since  it  represents  a simple  disturbance  traveling  from  left  to  right  with  velocity  1.  Thus,  it  can  depend  on  U0 
only  at  values  of  x that  the  disturbance  has  had  time  to  reach.  In  view  of  this  appeal  to  causality  the  integral  in 
Eq.  (17)  should  only  extend  from  -«•  to  x.  Then 


f/(x,f)  = (/0(x.O  + 


K(x,y)U0(y,t)dy 


(20) 


or,  from  Eq.  (19)  and  a fundamental  property  of  the  delta  function, 


U(x.t)  = 6(x  -t)  + R(x  + t)  + K{x.t)  + 


^(x.y^Cy  + ?)dy 


(21) 


Now,  if  t >x.  Eq.  (21)  becomes 


0 = R(x  + y)  + K(x,t)  + 


K(x,y)R{y  + t)dy 


t>x 


(22) 


Since  R vanishes  for  negative  values  of  its  argument  the  integral  in  Eq.  (22)  actually  can  be  written  with  -t  as  its 
lower  limit,  Equation  (22)  is  a Fredholm  integral  equation  for  the  kernel  K(x,t)  in  terms  of  the  Fourier  transform 
of  the  reflection  coefficient  r(«). 

If  Eq.  (17)  is  substituted  into  Eq.  (15)  it  is  found,  as  usual  when  the  Gelfand-Levitan  technique  is  used,  that 
K(x,t)  satisfies  the  differential  equation 


Kxx-Kft-PMK*  0 


(23) 


subject  to  the  boundary  conditi ons 


K(xf-°°)  = 0 


2d  K(x,x) 
dx 


= F(x) 


(24) 


the  second  of  which  provide;,  the  solution  of  the  inverse  scattering  problem  for  Eq.  (5). 

If  a Fourier  transform  is  applied  to  Eq.  (20)  a solution  ^ of  Eq.  (5)  will  be  expressed  in  the  form 


^(x,£)  = + 


K(x,y)\l>o(y,t')iy 


(25) 
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where  \ho(xJE)  satisfies  Eq.  (5)  when  P(x)  is  zero  and  ^(xS)  ^ (•*►£)  **  x -*  -<*.  In  Eq.  (25)  the  independent 

variable  r has  been  replaced  by  x to  indicate  that  the  range  of  the  independent  variable  is  (-<*>,«»)  rather  than  (0,°»). 

In  particular,  Eq.  (2S)  holds  when  E is  zero.  Therefore,  because  of  Eqs.  (8)  and  (9), 


Z-'n*ZZ'n\\+ (26) 


which  guarantees  that  for  any  given  real,  positive  Z0  lim  Zm  Z0- 

X ■*  -o° 

To  prescribe  Z,  it  is  necessary  to  choose  an  appropriate  value  of  r(0),  the  limiting  static  value  of  the  reflec- 
tion coefficient.  Since  this  corresponds  to  the  case  in  which  the  wavelength  is  infinite  it  might  be  expected,  at  least 
for  a finite  inhomogeneous  transmission  line,  that  r(0)  corresponds  to  the  vanishing  of  the  length  of  the  inhomo- 
geneous line.  Thus,  it  would  be  expected  that 


r(0)  = 


Z\  - Z0 
Z,  + Z o 


(27) 


which  is  the  usual  value  of  the  reflection  coefficient  at  the  interface  between  to  homogeneous  transmission  lines 
having  characteristic  impedances  Z»  and  Zj . 

To  see  that  Eq.  (27)  is,  in  fact,  the  correct  condition  consider  that  because  of  the  properties  (11),  r(0)  and 
r(0)  are  both  real  and  that,  because  of  Eq.  (IX  V » constant  over  the  inhomogeneous  line  at  zero  frequency.  Now 
from  the  definition  of  r and  t and  from  Eq.  (8). 


KZJ,,a  ■ 1 ♦ KO)  VZV'  * KO)  (28) 

It  then  follows  from  energy  conservation  (1 1 ) and  from  (28)  that 

Z0[l  + K0))*  -f*(0)Z,  - (1  -^(O^Z,  (29) 


Equation  (29)  has  two  roots.  One  of  these  is  r(0)  * -X  which  is  ruled  out  by  energy  conservation  except  when  the 
line  is  short  circuited.  The  other  root  is  given  by  Eq.  (27). 

A heuristic  argument  «"»a«  to  the  one  just  given  leads  to  the  fundamental  integral  equation  of  Marcenko.  In 
this  case  the  incident  wave  is  from  the  right  and  therefore  has  the  form  6(t  + x),  while  the  reflected  wave  has  the 
form  S(r  - x).  The  argument  immediately  leads  to  the  integral  equation  for  the  corresponding  kernel  H(x,t), 
which  transforms  a solution  of  Eq.  (16)  Into  a solution  of  Eq.  (IS): 


0 » S(t  - x)  ♦ H(x,-t ) 


■f 


H(x.v)S(t  -y)iy 


t<-x 
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If  t is  replaced  by  -t  the  equation  becomes 


0 ■ S(-t  - x)  + H(x,t)  + / H(x,y)S(-t  - y)dy 


t>x 


(30) 


where 


SC- r) 


p(cj)e'UT  du 


In  the  Marcenko  method  the  reflection  coefficient  p is  not  actually  the  one  used.  Instead,  the  boundary 
condition  i^(0)  * 0 is  imposed,  and  the  corresponding  reflection  coefficient  p0  then  has  a modulus  of  1 so  that  its 
Fourier  transform  will  not  exist  as  a proper  function.  This  difficulty  can  be  resolved  by  considering  solutions  \p±(x) 
of  Eq.  (S)  having  the  asymptotic  forms 


^/±(x)  'N'  exp(±iy/Ex)  as  x -*•  00 


These  solutions  can  be  represented  in  accordance  with  Eq.  (30),  as 


*±(*)  = exp(±iV£x)  + j£°°  H (x  ,y  )exp(  ti^Ey  )dy 


(31) 


The  boundary  condition  will  be  satisfled  by  the  solution  + p0 1^+;  hence, 


«^_(0)  + po(F)^+(0)  = 0 


or 


Po(E) 


.-»»<■>) 


-11+  I W(0j>)exp(-fVZ>)d>-] 
1 + J'  H(0y)exp(iy/Fy)dy 


(32) 


It  follows  that  1 + Po(E)  * 1 - ^_(0)/^+(0)  ■ o(l)  for  large  real  y/F.  The  Fourier  transform  of 
1 - *_(0)/*+(°)  respect  to  y/F  thus  exists  and  gives  the  transform  of  p0  except  for  a delta  function. 
Since  the  incident  disturbance  does  not  reach  x * 0 until  t * 0,  it  must  produce  a reflected  wave  independent  of 
the  boundary  condition  at  x * 0 until  t « 0.  It  must  therefore  be  equal  to  S(r)  for  r < 0 or  5(-r)  for 
t > 0.  Thus,  from  Eq.  (30) 
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(33) 


0 » F(t  + x)  + H(x,y ) ♦ 


Wx,y)F(t  + y)dy 


where 


exp(i !'JET)d\'W 


(34) 


which  is  Mircenkc’s  result. 

Sharpe  [1963]  was  able  to  adapt  this  solution  of  the  inverse  scattering  problem  for  Eq.  (5)  to  the  transmission 
line  problem  despite  the  fact  that  it  is  based  on  a reflection  coefficient  corresponding  to  the  boundary  condition 
* 0 at  x m 0 by  observing  that  the  Fourier  transform  of  the  kernel  F in  Eq.  (33)  can  be  expressed  directly  in 
terms  of  elements  of  the  scattering  matrix  for  the  transmission  line  without  the  short  circuit  at  zero.  If  the  line  is 
assumed  to  be  homogeneous  for  negative  x,  the  solution  $ corresponding  to  a wave  of  unit  amplitude  incident 
from  the  left  will  be  given  by  tp  * r(w)^+  where  the  frequency  to  now  replaces  y/E  of  the  quantum  mechanics 
problem.  For  negative  x 


4>  * exp(ik0x)  + r(u))exp(-ikoX) 


Hence, 


<M°) 


1 + 


K<*>) 


Also,  from  the  properties  (1 1)  and  Eq.  (31)  it  follows  that 


*-“»-L£r^d 

f*(«) 


(35) 


(36) 


Relations  (35)  and  (36)  permit  the  construction  of  the  kernel  F when  r (u>)  and  r(w)  are  known.  Thus,  the  use 
of  Marcenko's  version  of  the  Gelfand-LeviUn  technique  for  the  transmission  line  problem,  introducing  the  unnecessary 
boundary  condition  a<  zero,  leads  to  a requirement  for  knowledge  of  two  elements  of  the  scattering  matrix,  while  the 
direct  approach  leads  to  the  solution  in  terms  of  Eq.  (22),  which  requires  c knowledge  of  just  the  reflection  coefficient 
K<*>).  As  indicated  earlier,  r(u>)  can  be  determined  when  K<*>)  is  known  with  the  use  of  Eq.  (12),  and  it  is  given 
explicitly  by  Eq.  (13)  when  r(w)  is  a rational  function.  In  general,  it  would  appear  to  be  a cumbersome  procedure, 
however. 
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APPROXIMATE  SOLUTIONS 


Here  we  review  briefly  some  of  the  approximate  analytical  methods  for  the  inverse  scattering  problem, 
but  numerical  methods  as  such  are  not  considered.  One  of  the  greatest  benefits  to  be  derived  from  an 
approximate  method  is  the  possibility  that  it  will  provide  some  physical  insight  that  might  not  be  readily 
obtained  from  a more  complicated  exact  method.  Approximate  methods  are  particularly  useful  if  they 
shed  some  light  on  what  is  most  important  in  the  experimental  data  and  what  properties  of  the  transmission 
line  can  be  deduced  from  incomplete  cr  partially  erroneous  data. 

For  many  years,  an  approximate  method  has  been  used  to  solve  the  inverse  scattering  problem  associ- 
ated with  vertical  incidence  radio  soundings  of  the  inonosphere  [Rydbeck,  1942).  At  a given  carrier  fre- 
quency a pulse  is  sent  vertically  upward,  and  the  time  delay  of  the  return  is  observed  experimentally  as  a 
function  of  the  frequency.  These  data  are  then  used  to  determine  the  variation  of  electron  density  as  a 
function  of  height  by  a method  which  is  based  essentially  on.  the  W.K.B.  approximation  to  the  solution  of 
Eq  (5).  The  method  has  been  limited  mathematically  to  the  case  of  a single  turning  point;  that  is,  the 
assumption  is  made  that  P(x)  is  a positive,  monotonicaily  increasing  function.  In  the  quantum  mechanical 
interpretation  this  is  equivalent  to  solvintt  Abel’s  problem  in  classical  mechanics:  to  determine  the  shape  of  a 
hill,  given  the  time  it  takes  for  a particle  to  go  up  and  return  as  a function  of  the  particle’s  initial  velocity  at  the 
foot  of  the  hill  [ Courant  and  Hilbert,  1931.  p.  134] . In  fact,  both  problems  lead  to  the  Abel  integral  equation. 

When  P(x)  is  a monotonicaily  increasing  function  the  reflection  coefficient  r(u.)  has  unit  amplitude  due  to 
total  reflection  at  some  height  where  the  plasma  frequency  is  equal  to  the  carrier  frequency  cf  the  incident  pulse; 
hence,  the  phase  of  r(oj)  constitutes  all  of  the  information  given  by  the  experiment.  In  accordance  with  the  W.K.B. 
method,  ‘he  phase  is  approximately 


<X*)  = 2 f [*5  -P(y))l,‘liy  (37) 


where,  for  c the  velocity  of  light  in  vacuum,  fr  = co/c  is  the  wave  number  and  h satisfies  the  equation 


k 3 = P(h) 


(38) 


The  group  time  delay  of  a pulse  is  then  given  by 


/• h 

l<y>  .2 kf 

c dk  cl 

*o 


[ k 1 -fO-Jp’^dy 


(39) 


The  left  side  of  Eq.  (39)  is  determined  from  the  experimental  data,  and  Eq.  (39)  is  the  Abel  integral  equation  for 
which  the  method  of  solution  is  well  known  (cf.  Whittaker  and  Watson  1948,  p.  229] . 

Karp  and  Shmoys  [1955]  considered  the  inverse  scattering  problem  for  Eq.  (5)  given  the  transit  time  of  a 
pulse  propagating  through  the  medium  between  two  fixed  points.  They  also  used  the  W.K.B.  approximation  and 
thus  started  with  a relation  similar  to  Eq.  (39)  except  for  the  limits  of  integration. 
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The  transit  time  is  given  by 


7is(*) 


[*’  ~P[y)]-utiy 


(40) 


As  indicated  earlier,  the  transmission  coefficient,  and  hence  the  transit  time  obtained  from  the  phase  of  the 
transmission  coefficient,  is  not  sufficient  to  determine  P(x ) uniquely.  However,  Karp  and  Shmoys  point  out  that  in 
this  approximation  P can  be  determined  as  a function  of  a variable  (,  which  is  defined  by 


~x,*i 

i 

where  the  sum  is  over  all  values  x(  of  x such  ‘hat 


(41) 


/>  */>(*,) 


(42) 


and  where 


* i 


1,  if  Piy)  <■  P for  x(  <y  < xi+, 
0,  if  P{y)  > P (at  Xj<  y < xi+l 


That  is,  for  every  possible  value  of  P the  Xj  are  roots  of  Eq.  (42)  and  $ is  then  determined  by  Eq.  (41),  thus 
defining  {( P ) and,  by  inversion,  P( {).  Equation  (40)  then  becomes 


f»s(*) 


a~fXl~Xl 


l*1 


(43) 


Karp  and  Shmoys  were  then  able  to  transform  Eq.  (43)  into  an  integral  equation  that  can  be  solved  by  means  of  a 
Mellin  transform. 

An  approximate  solution  to  the  direct  scattering  problem  for  the  transmission  Hite  equations  (1),  similar  to  the 
W.K.B.  approximation  used  in  connection  with  Eq.  (S).  leads  more  simply  to  a solution  of  the  inverse  scattering 
problem  [BoPnntr,  1950] . For  small  (4Z/dx)/Z , Brtmmer  (1951  ] bas  shewn  that  a WJC..B.  type  of  approxima- 
tion is  valid  Kid  h,  in  fact,  the  lowest  term  of  a series  that  converges  to  the  solution.  For 


F(t)  « 


(44) 


the  reflection  coefficient, Us  lowest  noovanishing  order,  can  be  approximated  by  [Key,  1961] 


r ■ 


G(x)exp(-2i'Jkx)djr 


(45) 
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where  G(x)  = F(z)  whe.i  the  change  of  variable  (Eq.  (8))  from  r to  r is  introduced,  and  where 


k = u(L0Co)‘  2 


The  solution  is  obtained  then,  by  taking  a Fourier  transform  of  both  sides  of  Eq.  (45)  and  integrating  the  result  in 
accordance  with  Eq.  (44). 

Another  method  for  solving  the  inverse  scattering  problem  associated  with  Eq.  (5)  in  the  quantum  mechanics 
case  was  given  by  Jost  and  Kohn  (1952] , and  generalized  by  Moses  (1956]  to  cover  all  scattering  problems  for  Eq.  (5). 
including  higher  dimensional  cases  and  even  cases  where  P is  a linear  operator  of  a very  general  type.  The  approximation 
is  equivalent  to  the  Neumann  series  expansion,  of  which  the  Born  approximation  is  the  lowest  order  term,  for  the  so- 
lution of  the  direct  scattering  problem  in  a certain  sense.  If  the  series  solution  is  cut  off  at  a tern  of  order  n and  the 
resulting  approximation  to  P is  used  to  calculate  a Neumann  series  approximation  of  the  direct  scattering  problem  to 
the  term  of  order  n,  the  scattering  matrix  elemerts  will  be  reproduced  as  given. 

A particularly  simple  expression  for  P(x)  in  the  transmission  line  problem  is  obtamed  by  'his  method  when  the 
first  two  terms  of  the  expansion  are  used 

/>(*)%  -4(R'(2jr)  + £J(2<)]  (46) 


where  R is  defined  by  Eq.  (18).  This  result  is  exactly  what  is  obtained  from  the  Gelfand-Levita.i  method  if  the  integral 
equation  (Eq.  (22))  is  solved  by  iteration  to  two  orders. 


SOME  PROBLEM  AREAS 


In  applying  the  various  methods  of  solution  to  the  practical  determination  of  inhomogeneous  transmission  line 
characteristics,  two  questions  have  considerable  importance:  How  sensitive  will  the  result  be  to  errors  in  the  data,  and 
what  physical  features  can  still  be  determined  when  the  data  are  incomplete?  In  general,  there  are  reasons  to  believe 
that  the  inverse  scattering  p.oblem  is  very  sensitive  to  certain  kinds  of  error  in  the  data.  For  example,  the  scattering 
matrix  for  Eq.  (5)  can  be  calculated  approximately  in  the  direct  scattering  problem  by  the  vaiiationa!  method,  the 
validity  of  which  implies  tiiat  small  changes  in  the  data  could  produce  large  changes  in  the  wave  function  \j/.  This 
may  not  be  significant  for  the  inverse  scattering  problem  associated  with  Eq.  (5)  itself,  but  because  of  Eq.  (9)  the 
solution  to  the  inverse  scattering  problem  for  the  transmission  line  equations  (Eq.  ( 1 ))  is  given  in  terms  of  a wave 
function  of  Eq.  (5)  with  £ = 0.  Thus,  it  would  not  b surprising  to  find  that  the  characteristic  impedance  has  an 
unstable  pointwise  dependence  on  the  reflection  coefficient. 

For  inverse  scattering  problems  associated  with  Eq.  (5)  there  is  no  reason  to  expect  that  such  an  instability  exists 
when  P(x)  is  a monotonicaUy  increasing  function,  a case  for  which  the  Abel  integral  equation  approximation  pro- 
vides a reasonable  result.  The  situation  is  not  as  clear,  however,  when  F(x)  oscillates  - represents  a medium  consist- 
ing of  several  layers  as  in  the  case  of  the  real  ionosphere.  Shielding  by  front  layers  might  completely  obscure  layers 
farther  back.  This  can  happen  if,  at  the  lower  frequencies,  the  sounding  pulse  cannot  penetrate  far  enough  to  affect 
the  rear  layers,  and  if  the  higher  frequencies,  for  which  sufficient  penetration  occurs,  are  above  the  critical  ftequencies 
of  the  rear  layers. 

Mathematically,  these  problems  are  connected  with  the  frxt  that  the  kernel  R(y  + 1)  of  the  integral  equation 
(22)  is  the  Fourier  transform  of  the  reflection  coefficient  and  can  be  very  sensitive  to  pointwise  errors  in  the  data.  If 
the  reflection  coefficient  is  approximated  by  a rational  or  meromorphic  function,  the  important  properties  of  the  data 
will  be  in  the  location  of  the  poles  and  their  corresponding  residues.  Although  an  explicit  solution  to  the  inverse 
scattering  problem  for  Eq.  (5)  can  be  given  when  r(u)  is  rational  [Kay,  1960] . the  form  of  the  general  solution 
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is  too  complicated  to  exhibit  in  an  obvious  way  the  manner  in  which  various  layers  of  >*(•*)  might  be  correlated  to 
specific  features  of  the  data.  It  would  seem  reasonable  that  poles  far  from  the  real  axis  will  contain  the  information 
obscured  by  a shielding  effect  if  it  occurs.  It  is  possible  that  this  relationship  can  be  studied  and  clarified  by  con- 
sidering the  reflection  coefficient  p(u)  on  the  far  side.  This  is  theoretically  possible  since  p(w)  can  be  expressed 
in  terms  of  r(o)  with  the  .lid  of  relations  such  as  Eqs.  (11)  and  (12). 

The  question  of  how  r(«)  can  best  be  approximated  by  means  of  a rational  function  from  given  reflexion 
measurements  is  important  in  any  ca^e  since  the  solution  to  the  inverst  scattering  problem  can  then  be  given  explic- 
itly. The  approximation  must  be  such  that  all  known  physical  properties  of  the  transmission  Hue  are  taken  into 
account-for  example,  that  the  line  is  finite  in  length  and  is  terminated  by  an  impedance  equivalent  to  the  character- 
istic impedance  of  a known  homogeneous  line  of  infinite  length.  Heim  and  Sharpe  [1966]  have  discussed  some  such 
requirements  of  the  transmission  line  and  have  pointed  out  that  for  a finite  passive  line  r(o>)  must  be  meromorphk 
with  poles  only  in  the  lower  half  plane,  it  must  be  o(l ) in  the  upper  half  plane  and  on  the  real  axis,  and  it  must  be 
o(exp(2ikoK>]  in  the  lower  half  plane  if  the  inhomogeneous  line  has  length  £. 

Another  problem,  which  may  not  be  important  in  the  solution  of  a practical  inverse  scattering  problem,  is 
nevertheless  disconcerting.  The  exact  methods  used  for  solving  the  inverse  scattering  problem  a«sociat«.-d  with  Eq. 

(1 ) cannot  produce  a line  whose  characteristic  impedance  has  discontinuities  if  th*  mathematically  required  condi- 
tions of  regularity  are  imposed  [Heim  and  Sharpe,  1966] . For  insight  and  for  the  sake  of  completeness  it  would  be 
desirable  to  generalize  the  methods  to  include  discontinuous  solutions. 


ADDENDA 


A number  of  open  problems  exist  in  connection  with  the  inverse  scattering  problem  for  transmission  lines. 

Some  of  these  were  mentioned  earlier.  The  following  axe  added  here  primarily  because  Jiey  are  probably  solvable. 

Two  of  the  approximate  methods  for  the  inverse  scattering  problem  associated  with  Eq.  (5)  have  been  derived 
from  the  W.K.B.  approximation  for  the  direct  problem  but  have  not  been  derived  from  the  exact  solution  to  the 
inverse  scattering  problem  obtained  by  means  of  the  integral  Eq.  (22).  These  are  the  Abel  integral  equation  method 
for  data  in  the  form  of  the  time  delay  of  a reflected  puke,  and  the  method  of  Karp  and  Shmoys  for  data  in  the  form 
of  the  transit  time  of  a pulse  between  paints  on  either  nde  of  the  inhomogeneous  line.  It  would  certainly  be  of 
academic  interest,  at  lent,  to  derive  these  approximations  from  the  exact  method.  In  addition,  the  derivation  might 
lead  to  intermediate  approximations  that  would  improve  on  the  original  approximations  or  provide  additional  insight 
into  the  physical  relationship  between  the  data  and  the  transmission  line  characteristics. 

The  restriction  of  the  Abel  integral  equation  method  to  cases  in  which  P{x)  in  monotonic  increasing  might  be 
relaxed  by  such  a generalization,  thus  providing  a systematic,  practical  technique  for  interpreting  existing  ionospheric 
data.  If  the  method  of  Karp  and  Shmoys  were  correlated  to  an  exact  treatment  of  the  problem  the  precise  nature  of 
the  information  obtained  in  the  transmission  coefficient  alone  could  be  elucidated.  Tuis  is  related  to  the  very  practical 
question  of  what  information  can  be  obtained  from  partial  scattering  data. 

Another  open  question  is  the  solution  of  the  inverse  scattering  problem  for  the  2n  wire  coupled  system  of 
transmission  lines.  Jost  and  Newton  [1955]  and  Agranovich  and  Marcenko  [1958]  have  given  the  solution  for  a 
system  of  coupled  Schrodinger  equations  [cf.  Butler,  19C9,  who  considered  the  inverse  scattering  problem  for  differ- 
ential equations  of  higher  order  than  two] . A difficulty  in  applying  Marcenko’s  method  to  the  2n  wire  coupled 
transmissioa  line,  as  was  done  for  a single  2 wire  line,  is  the  fact  that,  in  general,  the  wave  velocity  v(z)  will  be 
different  for  each  channel  and  cannot,  therefore,  be  dealt  with  by  a simple  change  of  variables  such  as  that  of  Eq.  (8). 
One  possibility  b to  generalize  or  adapt  the  approximate  method  of  Moses  to  the  multiple  transmission  line  problem. 
Some  due  for  handling  the  difficulty  of  different  wave  velocities  might  thereby  be  revealed  so  that  the  exact  method 
of  Marcenko  might  finally  be  applied. 
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DISCUSSION 


Newton:  The  problem  you  mentioned  at  the  end  of  coupled  transmission  lines  is  analogous  to  the  quantum 

mechanical  problem  of  coupled  channels.  It  is  analogous  to  coupled  channels,  inelastic  scattering.  Prof.  Cox  is 
the  expert  on  that.  You  said  there  is  a relation  between  the  transmission  coefficient  and  reflection  coefficient. 

If  you  do  the  Marcenko  procedure,  you  don’t  have  to  put  that  relation  in  - the  two  come  in  independently. 

Kay:  His  expression  for  the  kernel  that  goes  into  the  Gelfand-Levitan  equation  is  given  directly  in  terms  of  the 

phase  shift. 

Newton:  If  you  don't  put  in  the  relation,  but  put  them  in  independently,  what  happens? 

Kay:  That’s  hard  to  answer.  You  will  be  violating  one  of  the  conditions  on  the  scattering  matrix  elements  them- 

selves. Whatever  potential  you  get  will  violate  that  condition.  It  would  have  some  kind  of  pathological 
behavior. 

Chadan:  In  the  case  of  a simple,  infinite  line,  what  happens  if  you  impose  the  boundary  condition  at  the  origin? 

You  have  two  solutions,  one  of  which  vanishes  at  the  origin.  In  matrix  form  you  have  the  right  to  left  and 
left  to  right  signals. 

Kay:  This  is  what  Marcenko’s  method  does.  You  arrive  at  the  boundary  condition  by  taking  a linear  combination 

of  the  two  fundamental  waves.  Then  you  get  a relationship  that  involves  both  the  transmission  and  reflection 
coefficients,  which  creates  a problem  because  they  are  not  independent. 

Sabatier:  You  have  spoken  only  of  passive  lines.  Are  there  analogous  results  with  nonpassive  lines  with  sources? 

Kay:  I don't  know  of  anyone  who  did  that.  When  you  have  sources,  the  conditions  imposed  on  the  analytic 

behavior  of  the  transmission  coefficient  are  no  longer  applicable.  This  is  analogous  to  what  happens  to  the 
reliability  conditions  in  circuit  theory.  For  passive  circuits,  all  your  poles  are  confined  to  one  half  plane 
and  they  don’t  lie  on  the  imaginary  axis.  For  active  circuits  they  can.  You  have  to  go  back  and  look  at  the 
consequences,  including  the  question  of  existence  and  uniqueness  of  the  solution  of  the  Gelfand-Levitan  equation. 
In  proving  this,  a very  important  contribution  is  the  conservation  of  energy. 

Mittra:  You  started  with  inductance  and  capacitance  in  the  transmission  lines  and  ended  up  with  one  function. 

That  ambiguity  can  never  be  resolved. 

Kay:  You  can  assume  anything  yoc  want  for  the  other  function  as  long  as  it  is  monotonic. 

Mittra:  If  the  transmission  line  nad  losses  and  you  know  the  capacitance,  resistance,  and  conductance,  can  the 

inductance  be  found? 


Kay:  No.  They  are  independent. 
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This  paper  provides  a brief  survey  of  recent  theoretical  progress  on  the  inverse  scattering  problem  for  radiation 
scattering  from  reflective  boundaries  and  variable  indices  of  refraction. 


INTRODUCTION 


The  theory  of  radiation  scattering  is  concerned  primarily  with  the  far-field  relations  between  incident  and 
scattered  radiation  in  the  presence  of  a scattering  object.  The  primary,  or  direct,  problem  of  this  theory  is  to  develop 
quantitative  information  about  these  scattering  relations  from  a knowledge  of  the  scattering  object.  The  secondary, 
or  inverse,  problem,  on  the  other  hand,  is  to  determine  the  nature  of  the  object  from  an  analysis  of  the  scattering 
relations. 

While  the  direct  problem  has  by  now  generated  a well-developed  discipline,  the  inverse  problem  has  only  recently 
aroused  any  serious  attention.  This  is  especially  surprising  in  view  of  the  manifold  practical  applications  to  the  design 
and  operation  of  precision  radar  systems.  This  paper  provides  a brief  survey  of  the  present  state  of  knowledge  and  possi- 
ble future  development  of  the  inverse  probiem  for  radiation  scattering. 


SCATTERING  FROM  A POTENTIAL 


The  first  real  contribution  to  the  inverse  problem  was  made,  not  by  radar  engineers,  but  by  nuclear  physicists, 
around  19S0.  The  physicists  were  interested  in  the  analogous  inverse  problem  for  the  scattering  of  electrons  from  a 
nuclear  potential:  to  determine  the  dupe  of  the  potential  from  the  resulting  scattering  data. 

Levinson  [1949]  tint  showed  that  every  spherically  symmetric  potential  satisfying  suitable  regularity  conditions, 
is  uniquely  determined  by  a knowledge  of  the  phase  shifts  in  the  partial  wave  analysis  of  the  scattering  data.  His  proof 
however,  gives  no  algorithm  for  constructing  the  potential  from  a given  set  of  phase  shifts.  Jost  and  Kohn  [1952]  then 
provided  a recursive  algorithm  for  tlb*  purpose,  which  converges  if  the  potential  is  sufficiently  weak  and  there  are  no 
bound  states.  The  recursion  relations  are  clumsy  to  use  in  practice,  however,  am!  the  conditions  required  on  the  poten- 
tial are  too  restrictive  for  many  applications.  Meanwhile,  Gelfand  and  Levitan  [1951]  showed  that  under  suitable 
regularity  conditions  the  potential  may  be  recovered  from  the  solution  of  a certain  integral  equation  whose  kernel 
depends  on  the  spectral  weigh**.  These  weights  in  turn  may  be  obtained  from  a knowledge  of  the  phase  shifts  and 
binding  energies  of  the  problem.  Since  then  a number  of  variants  and  refinements  of  the  Gelland-Levitan  procedure 
have  appeared;  they  are  summarized  in  the  excellent  survey  of  Faddeev  [1963] . 

In  this  analysis  it  suffices  to  know  the  phase  shifts  and  binding  energies  for  any  tingle  value  of  the  angular 
momentum.  Wheeler  [19S5]  and  Newton  [1967]  have  shown  that  a quite  similar  analysis  reconstructs  the  potential 
from  the  phase  shifts  for  all  values  of  the  angular  momentum  at  any  tingle  value  of  the  energy.  In  a sense,  these  results 
complement  each  other  and  exhaust  the  possibilities  for  radial  potentials. 
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Foi  potentials  that  are  not  spherically  symmetric,  the  situation  is  more  complicated  in  practice  but  essentially 
the  same  in  principle.  In  a series  of  papers  devoted  to  this  problem,  Kay  Moses  (1955-1956)  have  shown  that 
the  Gelfand-Levitan  procedure  can  be  extended  to  nonspherically  symmetric  problems  in  such  a way  that  the  poten- 
tial can  be  recovered  from  the  solution  of  an  integral  equation  whose  kernel  depends  ultimately  on  the  matrix  elements 
of  the  scattering  operator.  For  this  procedure  it  suffices  to  know,  at  least  in  principle,  the  scattering  data  (cross  sections 
and  phase  shifts)  for  a single  aspect  angle,  all  scattering  angles  over  a fixed  hemisphere,  and  all  energies.  The  a priori 
conditions  required  of  the  potential,  however,  are  not  investigated  in  their  papers. 

Moses  [1956]  also  has  shown,  by  means  of  a similar  extension  of  the  Jost-Kohn  procedure,  that  the  general 
potential  can  be  recovered,  at  least  in  principle,  from  a knowledge  of  the  backscattering  data  lor  all  aspect  angles 
over  a fixed  hemisphere  and  all  energies.  Again  the  a priori  conditions  required  are  not  clear,  although  the  procedure 
appears  to  be  applicable  for  sufficiently  weak  potentials. 

None  of  these  results  applies  directly  to  the  inverse  problem  for  radiation  scattering,  which  involves  scattering 
from  a variable  index  of  refraction,  from  a reflective  boundary,  or  from  both,  rather  than  from  a nuclear  potential. 

In  the  first  case  we  seek  to  determine  the  structure  of  the  index  of  refraction  rather  than  the  structure  of  the  poten- 
tial, so  that  we  are  dealing  with  an  inhomogeneous  problem  whose  inhomogeneous  term  is  frequency  dependent  In 
the  second  case  we  seek  to  determine  the  shape  of  the  boundary,  so  fiat  we  are  dealing  with  a homogeneous  b uindary 
value  problem  rather  than  an  inhomogeneous  problem  without  boundary.  Moreover,  radiation  scattering  involves  the 
vector  wave  equation  rather  than  the  scalar  wave  equation,  although  this  difference  does  not  appear  to  be  decisive. 


SCATTERING  FROM  A VARIABLE  INDEX  OF  REFRACTION 

Superficially,  the  problem  of  radiation  scattering  from  a region  of  variable  index  of  refraction  strongly  resem- 
bles the  problem  of  electron  scattering  from  a nuclear  potential,  and  similar  methods  might  be  expected  to  apply. 
Specifically,  the  reduced  wave  equation  for  both  problems  has  the  form 

V2  0 + <uJ0  = F(jr,co)0 

where  is  the  case  of  potential  scattering  the  inhomogeneous  term 

F(jc.co)  = F(x) 

is  independent  of  the  frequency  u>,  while  m the  case  of  refraction  scattering  the  inhomogeneous  term 

F(x,w)  = to1  [1  - n2  (x)] 

is  frequency  dependent.  For  fixed  frequency,  then,  the  solutions  to  the  two  direct  problems  have  the  same  form,  but 
their  dependence  on  frequency  is  different;  and  this  difference  forces  a considerable  difference  in  the  solutions  of  the 
associated  inverse  problems.  In  particular,  the  Gelfand-Levitan  procedure  depends  essentially  on  the  frequency  inde- 
pendence of  the  inhomogeneous  term. 

In  the  spherically  symmetric  case  the  index  of  reh.  non  depends  upon  only  the  radial  variable  and  the  problem 
can  be  presented  in  one-dimensional  terms.  Kay  and  Moses  [195S]  and  Moses  and  DeRidder  ( 1 963]  have  shown  that 
in  this  case  the  refraction  problem  can  be  reduced  to  the  potential  problem  by  means  of  a form  of  Liouville  transfor- 
mation, and  that  the  corresponding  inverse  problem  can  then  be  resolved  by  means  of  the  Gelfand-Levitan  procedure. 
Specifically;  in  this  case  the  refraction  problem  reduces,  via  a partial  wave  analysis,  to  the  one-dimensional  problem: 

+ «2«2(r)j  <K0  = 0 

0(0)  = 0 0'(O)  = 1 
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If  we  introduce  a new  independent  variable 


p(r> 


n(r‘W 


and  a corresponding  new  dependent  variable 


Mp)  = nl'3(r)(p(r) 


then  we  find  that  if/(p)  satisfies  the  equation 


iHp)  * y(p)Mp) 


where  the  “potential"  V(p)  has  the  form 


K(P)  = n~,,^(',+,'i) 

K(p)  and  K'(p)  -*■  0 as  p -*•  «* 

Now  V,  and  hence  n,  may  be  deduced  from  a knowledge  of  the  S-wave  phase  shifts  via  the  Gelfand-Levitan  algo- 
rithm. Note  that  the  new  independent  variable  here  is  essentially  the  optical  path  length,  and  that,  in  the  new  vari- 
ables, the  optical  path  metric  is  constant. 

In  principle,  this  procedure  can  be  extended  to  nonspherically  symmetric  problems,  but  in  practice  new  diffi- 
culties intrude.  If  we  start  from  the  reduced  wave  equation 


[T2  + w2n2(x);  0(x)  = 0 


and  introduce  the  Ansatz 


0(x)  = f*fcS(x)^(x) 


then  we  find 

^7*  + - |7S|2)  + iu>(V2S  + 2^~‘  7 VS)  + *-1  72 
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Hence,  if  we  choose  S so  that 


|7S|2  - n2  = 0 


kS(x)  -*  k-x  as  ix|  -*• 00 


and  then  choose  ^o(x)  so  that 


27  i^0  ■ 7 S + 0o  72S  = 0 


then  we  find  that,  with  O0  = 4*0^$. 


( 7J  + cj2h2  - 4/0‘  71  4/0)4>o  = 0 


Hence,  the  solution  of  the  original  equation  must  also  satisfy  an  equation  of  the  form 


(72  + o>2n3  - 0o  ‘72^o)0  = ,7Ji|'o)0 


The  right-hand  side  here  is  independent  of  w and  serves  as  a potential,  for  which  a suitable  extension  of  the 
Gelfand-Levitan  procedure  would  apply. 

Note  that  here  S satisfies  the  eikonal  equation,  and  ipo  is  related  to  the  gaussian  curvature  of  the  level  surfaces 
of  S.  In  effect,  we  have  introduced  as  new  coordinates  essentially  the  rays  and  wave  fronts  of  the  geometric  optics 
solution,  which  *s  frequency  independent.  This  transformation  is  nonsingular,  however,  only  if  there  are  no  caustics 
in  the  geometric  optics  solution.  The  presence  of  caustics  presents  additional  difficulties  which  have  not  yet  been 
resolved. 

a 

The  Gelfand-Levitan  procedure  is  not  even  formally  applicable  to  problems  involving  frequency -dependent 
perturbations.  The  Jost-Kohn  procedure,  on  the  other  hand,  is  not  sensitive  to  such  a frequency  dependence  and 
normally  extends  directly  to  the  general  refraction  problem.  Briefly,  this  procedure  begins  by  expanding  the  reactance 
matrix  T in  the  Born  series  [ Prosser , 1969] : 


yk2  1 

k2w(k'-k”)  — - w(k"-k)dk”  + ••• 

k ~k 

v 

where  k and  k'  are  the  incoming  and  outgoing  momenta,  k2  = k'2  = u2 , irtd  w(k)  is  the  Fourier  transform 
of  1 -«2(x).  The  reflection,  or  backscatter,  coefficient  £>(k)  is  simply  th^Vilue  of  T(k',  k)  at  k'  = -k.  If  we 

xfs  ->-■ 

now  replace  b(k)  by  eb(  k),  and  w(k)  by  u^k)  = / ^w^fk),  substitutes  in  the  expansion  above  tor  T(-k,k), 
and  equate  the  coefficients  of  en,  we  can  solve  for  the  ww(k)  in  terms  of  the  reflection  coefficient  b( k): 
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k]wi(k)  = fc(k) 

kJw,(k)=  /*„,»' -h")  H-1(k"-k)dk 

k5w(k)  «y^kawn(k) 


This  procedure  determines  the  index  of  refraction,  at  least  in  principle,  from  a knowledge  of  the  backscatter  coeffi- 
cient for  all  aspect  angles  and  all  frequencies.  A similar  procedure  determines  the  index  of  refraction  from  the  scat 
tering  angles  and  all  frequencies,  or  for  fixed  scattering  angle,  all  aspect  angles  and  all  frequencies,  or  for  certain  other 
combinations  of  data.  These  solutions  are  all  formal,  and  give  no  information  on  the  question  of  convergence  or  the 
a priori  restrictions  required  of  the  index  of  refraction.  Mom  [1956]  has  shown,  however,  that  in  the  spherically 
symmetric  case  the  fust  two  approximations  to  the  index  of  refraction  obtained  by  this  method  agree  with  the  fust 
two  approximations  obtained  by  the  Gelfand-Levitan  procedure,  so  that  there  is  some  reason  to  be  optimistic  about 
the  prospects. 


SCATTERING  FROM  A REFLECTIVE  BOUNDARY 


For  the  problems  of  radiative  scattering  from  reflective  boundaries,  the  preceding  analysis  does  not  directly 
apply.  Certain  approximate  results  are  available,  however,  which  suggest  various  avenues  for  future  study. 

If  the  boundary  is  known  to  be  smooth  and  convex,  with  a gaussian  curvature  everywhere  positive,  and  if  the 
frequencies  involved  are  sufficiently  high,  then  the  geometric  optics  approximation  to  the  scattering  matrix  may  be 
valid.  According  to  this  approximation,  the  scattering  cross  section  measured  through  a scattering  angle  0 is  given 
by 


<7(0)  * 


where  R(0/2)  is  the  reflectivity  coefficient  at  the  reflection  point  on  the  surface  (whose  normal  lies  at  an  angle 
0/2  with  the  incident  ray)  and  (7(0/2)  is  the  gaussian  curvature  of  the  surface  of  that  point  [Keller,  1959] . Thus, 
if  the  reflectivity  is  constant  over  the  surface,  the  scattering  cross  section  is  inversely  proportional  to  the  gaussian 
curvature  at  the  reflection  point.  It  follows  that  a knowledge  of  the  scattering  cross  section  for  a single  (high)  fre- 
quency at  two  (diametrically  opposite)  aspect  angles  and  all  scattering  angles,  or  alternatively,  a knowledge  of  the 
backscatteung  cross  section  for  a single  frequency  at  all  aspect  angles,  will  determine  the  gaussian  curvature  of  each 
point  of  the  surface  in  terms  of  the  direction  cosines  of  the  normal  at  the  point.  Note  that  the  convexity  of  the 
surface  assures  that  there  is  just  one  such  point  for  each  normal  direction. 

Thus,  we  are  led  to  the  problem  of  determining  a smooth  convex  surface  from  a knowledge  of  its  gaussian 
curvature.  This  problem  is  a classic  in  the  geometry  of  surfaces,  and  goes  by  the  name  of  Minkowski’s  problem.  It 
is  known  that  Minkowski’s  problem  admits  a unique  solution  provided  that  the  gaussian  curvature  G(n),  as  a func- 
tion of  the  normal  direction  n , satisfies  the  obvious  necessary  condition 
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If  this  condition  is  satisfied,  the  surface  may  be  obtained  as  the  solution  of  a curtain  nonlinear  elliptic  differential 
equation  of  Monge-Ampere  type.  [Nirenberg,  1953] . If  the  surface  is  described  in  rectangular  coordinates  m the 
form  z = 6 (x  ,y ) then  the  determining  equation  lias  the  form 


rt  - s2 

( 1 + p2  + q2  )2 


G{p,q.  1) 


where  p = 30/dx,  q = dd/dy , r = d2Q/dx2,  s = A20/dxdy , t = d2Q/dy2 , and  G(p,q,l)  denotes 
the  gaussian  curvature  of  the  point  whose  normal  has  direction  numbers  (p,  q,  1 ). 

The  proof  of  the  existence  and  uniqueness  of  the  solution,  however,  is  nonconstructive,  and  no  effective  algo- 
rithm for  constructing  the  solution  has  appeared. 

Special  cases  of  this  result  are  sometimes  useful.  If,  for  example,  the  surface  is  known  a priori  to  be  axially 
symmetric,  it  may  be  described  in  the  form  x = 0(p)  where  pJ  = y2  t z2 . Then  the  determining  equation 
becomes 


p(l  + <t>'2)2 


GW) 


where  now  G(0')  denotes  the  gaussian  curvature  given  as  a function  of  the  normal  direction  (1,0'  cosa.0’  sin  a ). 
the  axial  symmetry  implies  that  the  gaussian  curvature  is  the  same  for  all  values  of  a = arctan  z jy  and  so  depends 
only  on  the  slope  0’.  Hence,  it  suffices  to  know  the  curvature  only  at  points  with  different  values  of  0’  -that  is, 
only  along  any  profile.  In  this  case  the  determining  equation  is  integrable  for  0',  and  hence  for  0,  by  quadrature. 

We  conclude  that  for  axially  symmetric  smooth  convex  bodies,  it  suffices  to  know  the  monostatic,  monochro- 
matic scattering  cross  section  for  each  aspect  in  any  plane  containing  the  axis  of  symmetry. 

If  polychromatic  scattering  data  are  available,  then  we  might  expect  that  fewer  aspect  angles  would  be  required. 
To  see  how  this  can  come  about,  we  consider  again  a smooth  convex  body,  but  replace  the  high-frequency  incident 
wave  with  a short  pulse  and  use  the  physical  optics  approximation  in  place  of  the  geometric  optics  approximation. 
Then  in  this  approximation  the  backscatter  return  from  a short  pulse  incident  on  the  surface  is  essentially  propor- 
tional to  the  second  derivative  of  the  cross-sectional  area  of  the  surface  sectioned  at  a distance  ct  from  the  source 
and  normal  to  the  incident  direction.  Thuswehave  o(2r)  = const  A"  (cr)  where  A(x)  is  the  cross  sectional  area 
of  the  surface  in  a vertical  plane  normal  to  the  x axis  and  at  distance  x from  the  origin,  and  A " (x)  = A2  A I Ax2 . 
Here  the  source  lies  at  the  origin,  with  the  incident  direction  along  the  x axis  [Lewis,  1969] . 

From  these  relations  we  can  recover  the  cross-sectional  area  A{x)  as  a function  of  x.  If  we  now  do  the  same 
for  all  other  horizontal  aspects,  we  obtain  a two-parameter  family  of  cross-sectional  areas  of  the  surface. 

The  problem  of  recovering  a surface  from  a knowledge  of  its  cross-sectional  areas  is  another  classic  in  surface 
geometry,  and  is  known  as  Radon’s  problem.  This  problem,  too,  has  - unique  solution  for  smooth  convex  bodies 
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provided  only  that  certain  necenaxy  conditions  are  satisfled.  The  solution  may  be  expressed  in  terms  of  quadra- 
tures over  derivatives  of  the  cross-eectional  area  data.  If,  for  example,  the  surface  is  described  in  rectangular  coor- 
dinates as  za4(xy)  and  we  are  given  the  areas 


^(r.d)=  / d(*.>)dL 

where  I(r,d)  is  the  line  in  the  xy  plane  given  by  x sin  6 + y cos  0 = r,  then  the  function  <t>(x,y)  may  be  recovered 
through  the  formula 

/2ir  max  ... 

where  p denotes  the  projection  of  the  vector  (r.fl)  on  the  vector  (x,y). 

Thus  it  suffices,  at  least  in  principle,  to  know  the  backscattering  amplitude  as  a function  of  time  resulting  from 
a short  pulse  incident  on  the  surface  from  each  aspect  in  any  plane  passing  through  the  surface. 

If  it  is  known  a priori  that  the  surface  is  axially  symmetric,  then  it  suffices  to  probe  the  surface  with  a short 
pulse  in  two  diametrically  opposite  aspects.  If  these  aspects  are  taken  as  “nose  on”  and  “tail-on”  along  the  axis  of 
symmetry,  the  formulas  above  reduce  to  the  following. 

x * dip)  p - ly1  + z1)1'1  ■ d(*) 

o{2r)  ■ const  i4"(cr) 

A(x)  * up1  * ird(Jt)2 


It  follows  that 


*(*)-  [ir-1  >4 (*)]*'* 


Similar  formulas  for  other  aspects  may  alto  be  derived  along  the  same  lines. 

The  weakness  in  all  of  these  arguments,  of  course,  lies  in  the  optics  approximations,  which  are  notoriously  bad, 
particularly  near  the  shadow  line.  Nevertheless  these  arguments  make  plausible  the  hypothesis  that  monostatic  scatter- 
ing data  depending  essentially  on  two  parameters,  such  as  two  independent  aspect  angles  or  one  aspect  angle  and  the 
frequency,  should  suffice  to  determine  the  shape  of  a reflective  boundary. 

It  would  be  highly  desirable  to  have  a procedure  for  reconstructing  the  shape  of  the  boundary  from  the  scattering 
data  that,  in  principle  at  least,  is  exact.  The  analysis  of  Lax  and  Phillips  [1963]  has  shown  that  every  smooth  closed 
reflective  surface  is  uniquely  determined  by  the  scattering  matrix  - by  the  totality  of  the  scattering  data  - so  that  the 
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search  for  a constructive  algorithm  is  at  least  not  doomed  from  the  outset.  Prosser  [1969]  has  shown  that,  at  least 
formally,  the  Jost-Kohn  procedure  can  be  extended  to  this  problem  as  well,  and  that  it  reconstructs  the  Fourier 
transform  of  the  characteristic  function  of  the  region  bounded  by  the  surface  as  a formal  series  of  approximations 
involving  the  backscatter  coefficient,  just  as  for  the  general  refractive  problem.  This  procedure  requires  the  back- 
scatter  data  of  all  aspects  and  all  energies,  so  that,  in  light  of  the  preceding  arguments,  the  solution  is  overdetermined. 
Moreover,  in  simple  examples  the  first  few  terms  of  the  approximation  don't  seem  to  give  anything  sensible  at  all. 

At  the  moment,  however,  this  procedure  is  all  we  have. 

One  other  approach  deserves  mention  here.  If  we  consider  the  boundary  scattering  problem  in  two  dimensions, 
it  can  be  reduced  to  a refraction  scattering  problem  by  means  of  a suitable  conformal  mapping.  Specifically,  suppose 
we  are  dealing  with  the  problem 


7 20  + c o2<b  = 0 outside  S 
<t>  prescribed  on  S 


defined  in  the  xy  plane.  If  we  put  z * x + i>,  and  intioduce  the  conformal  mapping  z = F\w)  of  the  exterior  of 
the  unit  circle  C in  the  w = u + iv  plane  onto  the  exterior  of  the  given  boundary  S in  the  z = x + iy  plane,  normal- 
ized so  that  F(°°)  = °°,  and  = 1 , then  the  equation 

^20  + w20  = O outside  S 

0 prescribed  on  S 


is  mapped  into  the  equation 


+ w2n2(w)0  = 0 outside  C 


0 prescribed  on  C 


where  the  “index  of  refraction”  is  just  n(w)  = |F’(w>)| . Because  of  the  normalization  of  F(w),n(*)-+  1 as 
Ih-I  -*■  00  and  the  scattering  data  are  all  unchanged.  Thus,  any  procedure  that  resolves  the  inverse  refraction  problem 
will  reconstruct  n(w)  = IF1  (w)l  from  the  scattering  data,  and  since  the  mapping  is  conformal,  F(w)  and  hence 
S can  be  reconstructed  from  lF'(w)l. 

In  two  dimensions,  then,  the  boundary  problem  becomes  a special  case  of  the  refraction  problem,  and  will  yield 
to  the  same  methods.  In  three  dimensions,  however,  the  method  of  conformal  mapping  is  not  available,  and  no  similar 
reduction  is  yet  known. 


PRACTICAL  CONSIDERATIONS 


Even  if  exact  theoretical  solutions  were  available  for  the  general  inverse  problems  of  refractive  and  boundary 
radiative  scattering,  the  task  of  translating  these  solutions  into  practical  procedures  for  the  design  and  operation  of 
precision  radar  systems  would  still  present  many  problems. 
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Let  us  suppose,  for  example,  that  such  a system  is  tc  be  used  to  identify  the  structure  of  an  unknown  object  in 
orbit  above  the  earth.  Then  to  begin  with,  the  index  of  refraction  along  the  radar  sight  path  is  generally  not  constant, 
and  will  depend  on  meteorlogical  and  other  natural  data  as  well  as  the  frequency  and  other  operating  parameters.  In 
addition,  the  general  shape  and  reflectivity  of  the  unknown  object  may  not  be  available.  In  particular,  the  surface 
certainly  need  not  be  simple,  smooth  or  convex,  and  the  reflectivity  need  not  be  constant  over  the  surface.  Moreover, 
the  aspect  angles  are  not  under  the  control  of  the  system  operator.  It  is  true  that  most  objects  in  orbit  contain  some 
mechanical  lossy  mechanism,  so  that  their  spin  axis  eventually  converts  to  the  axis  of  largest  moment  of  inertia  and 
they  tumble  end-over-end;  it  is  also  true  that  such  an  object  eventually  presents  every  aspect  to  a fixed  radar  m ,talla- 
tion.  Nevertheless,  the  task  of  extracting  the  scattering  data  An  each  aspect  of  an  object  whose  motion  is  unknown 
from  a time  record  of  observations  made  over  many  passes,  is  by  no  means  trivial.  Finally,  if  the  object  makes  only 
one  pass,  most  aspects  are  never  seen  at  all. 

Practically  speaking,  the  scattering  data  obtained  in  any  such  application  are  apt  to  be  inexact  and  incomplete, 
but  the  theoretical  procedures  described  above  for  recovering  the  object  from  the  scattering  data  all  assume  a complete 
and  exact  knowledge  of  this  data.  No  studies  have  been  made  of  the  stability  question:  whether  snail  errors  in  the 
data  cause  only  small  errors  in  the  result.  And  no  progress  has  been  reported  on  the  approximation  question:  what 
partial  data  can  be  made  to  yield  partial  results.  It  might  be  possible,  for  example,  to  infer  from  suitable  partial  data 
the  first  few  spherical  moments,  or  other  similar  signature  parameters,  of  the  unknown  object.  At  present  we  have 
no  information  in  this  direction. 

Finally,  even  if  all  of  these  problems  could  somehow  be  resolved,  the  remaining  task  u.'  computing  the  shape  of 
the  object  from  the  scattering  data  is  in  general  nontrivial.  If,  for  example,  the  object  is  known  to  be  smooth  and 
convex,  and  the  exact  geometric  optics  backscatter  data  are  obtained  from  all  aspects,  the  problem  remains  of  com- 
puting the  shape  of  the  surface  from  a knowledge  of  these  data,  even  though  the  solution  is  known  to  exist.  The 
trouble  is  that  the  required  shape  is  obtained  as  the  solution  of  a nonlinear  elliptic  equation  for  which  the  usual 
computing  algorithms  fail  miserably.  The  errors  appear  to  accumulate  uncontrollably  if  local  approximations  are 
used.  In  spite  of  considerable  work  on  this  problem  (mostly  classified)  no  successful  computational  procedure  has 
btjn  reported.  [GISAT,  1965] 


discussion 


Newton:  I'm  a little  puzzled  by  your  statement  that  you  can  generalize  the  Jost-Kohn  procedure  to  three  dimensions. 

The  first  step  is  to  equate  the  scattering  amplitude  to  the  first  Born  approximation.  The  Born  term  depends  only 
on  the  difference  between  * and  * but  the  scattering  amplitude  will  not  It  will  depend  on  both  vectors  inde- 
pendently. I’m  not  sure  how  to  start. 

Prosser:  The  first  approximation  for  the  potential,  in  the  case  of  potential  scattering,  is  simply  given  by 

Vt{2  k)  = T(~k,k) 

u 

Chadan:  When  you  expand  the  T matrix  as  a Born  series,  it  doesn’t  converge  if  you  have  bound  states.  How  do 

you  know  if  you  have  bound  states  or  not? 

Prosser:  Characteristically,  the  radiation  problems  have  no  bound  states.  But  the  convergence  question  is  still  open. 

Chadan:  When  you  go  to  higher  terms,  those  integrals  are  principal  value  integrals. 

Prosser:  Yes,  but  one  treats  them  with  the  usual  quantum  mechanical  prescription,  by  assigning  to  the  denominator 

a small  positive  imaginary  part. 

Bojarski:  The  geometrical  optics  problem  and  the  Minkowski  problem  have  been  solved  successfully  by  Keller  and 

Lewis  for  a body  of  revolution  [ Keller  1959] . 

Kay:  Do  you  think  that  the  Jost-Kohn  method  might  be  related  to  a multipolar  expansion  for  the  body? 

Prosser:  It  looks  to  me  a little  more  like  this  functional  integration  procedure  we  heard  about  here. 

Unidentified  speaker:  Since  mention  has  been  made  of  the  papers  of  Faddeev,  and  since  they  represent  considerable 

progress  in  the  inverse  problem  they  should  be  referenced  here:  rklady  May,  September  1966.  Presumably 

Faddeev  has  new  results. 

Prosser:  The  suggestion  is  that  a combination  of  his  results  and  the  methods  of  Kay  and  Moses  might  well  make  the 

Gelfand- Levitan  algorithm  accessible  in  three  dimensions.  That  would  be  a tremendous  advance. 
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DIAGNOSTICS  OF  NONRADIALLY  STRATIFIED  MEDIA 


J.  Shmoys  and  J.  Pirraglia 
Polytechnic  Institute  of  Brooklyn 


INTRODUCTION 


With  a few  exceptions,  the  inverse  scattering  methods  have  been  applied  to  one-dimensional  distributions  only. 
Both  the  trajectory  approaches  of  the  Abel  and  Herglotz  type  and  the  wave  approaches  of  the  Gelfand-Levitan  type 
were  directed  to  plane,  cylindrically,  and  spherically  stratified  distributions. 

In  recent  years  a number  of  papers  [Motes,  1956;  Kay  and  Moses,  1961 ; Prosser,  1969]  dealt  with  the  applica- 
tion of  the  wave-type  methods  to  asymmetric  distributions.  The  difficulties  with  this  approach  are  twofold.  First, 
the  wave  methods  are  much  more  complicated  than  trajectory  methods.  Second,  the  Jost  and  Kohn  approach  used 
by  Moses  and  Prosser  is  based  on  a Neumann  (Bom)  series,  which  converges  slowly  if  the  distribution  to  be  diagnosed 
is  very  extensive  and  of  sufficient  magnitude  to  produce  large  phase  shifts. 

Here  we  investigate  an  admittedly  primitive  method,  of  the  trajectory  type,  to  distributions  lacking  spherical 
(c  , in  two  dimensions,  cylindrical)  symmetry.  We  assume  that  the  medium  is  “thin”  enough  so  that  the  ray  remains 
rectilinear.  The  data  to  be  used  consists  of  the  change  in  phase  of  a wave  propagated  along  a ray  due  to  the  presence 
of  the  medium,  for  a given  set  of  rays. 


N73:11614 


TWO-DIMENSIONAL  DISTRIBUTION 


Let  us  consider  the  problem  of  diagnosing 
a two-dimensional  refractive  index  distribution 
n(r,0).  The  set  of  probing  rays  includes  all  straight 
lines.  The  parameters  characterizing  a paiticular  ray 
are  the  impact  parameter  b and  ray  direction  a 
(fig.  1).  The  refractivity  of  the  medium  will  be 
assumed  expandable  in  a Fourier  series: 


n(r.$)  - 1 


^ Nk(r)eik6 


0) 


It  is  difficult  to  conceive  of  a physical  situation  in 
which  the  representation  (fig.  I)  would  not  exist. 
The  phase  shift  for  a ray  is  then 
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We  an  simplify  this  further  by  recognizing  that  the  integrand  in  Eg.  (3)  has  symmetry  sbout  rr/2: 


Vt  — — — 

* sin(n-0) 


gik  (jt-0)  |$in  (jt-0)|  ■■  e**ff  (sin  0)~ 


-fl/2 

,((b)  “ I byVJt”  !«<'•*  + (-1)*  «^J(*in  »"* 


The  term  in  brackets  reduces  to  2 cot  JkfJ  for  k even,  and  to  2 i sin  k0  for  k odd.  Hence  1^  is  either  pure  real 
or  pure  imaginary.  The  functions  >k(b)  are  obtainable  directly  from  the  observed  phase  shifts,  since  they  are 
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simply  the  coefficients  of  a Fourier  series  expansion  of  <!>(£, a): 


'*<*> 


*(b.a)e~‘ka  da 


(5) 


Equations  (4)  represent  then  a set  of  independent  integral  equations  for  the  coefficient  functions  N^r). 


SOLUTION  OF  THE  INTEGRAL  EQUATION 


It  is  easy  to  see  that  for  k-0,  Eq.  (4)  is  equivalent  to  the  usual  Abel  equation  for  a radially  stratified  medium. 
Similarly,  for  k=  the  integral  equation  is 


or 


2i 


sin  0 (sin  0)  2 d0 


1 /.(») 

2 r b 


sin  0)~2  d0 


(6) 


so  that  we  obtain  an  Abel  equation  for  JV(r)/r.  Unfortunately,  simple  reduction  to  an  Abel  equation  is  not  possi- 
ble for  * > 1.  We  have  not  been  able  to  invert  the  equation  explicitly.  However,  integral  Eqs.  (4)  can  be  inverted 
numerically.  To  ar.’ss  the  difficulty  of  doing  so,  we  tested  a procedure  that  could  be  used  for  all  it  on  special 
cases: 

1.  Select  a set  of  values  of  b for  which  the  data  will  be  matched:  b0,  bt,  bt, . . . , in  decreasing  order. 

2.  For  r>b0  assume  _V(r)  * ;V0  exp  [(b0  -r)/H\,  when  H is  chosen  arbitrarily. 

3.  Using  the  exponential  approximation  in  the  integral  equation,  evaluate  N0. 

4.  For  b0<r<bi,  assume  N(r)  » Ar+B. 

5.  From  the  requirements  that  (a)  the  function  N(r)  be  continuous  at  b0  and  (b)  the  linear  -exponential 
approximation  satisfy  the  integral  equation  at  r - bt , obtain  two  equations  for  the  two  unknowns  A 
and  B,  that  can  be  solved. 

6.  Proceed  in  a similar  w;  y with  the  piecewise  linear  approximation  to  the  smallest  value  of  b for  which 
data  are  available. 

Wc  have  inverted  the  phase  shifts  calculated  for  A^fr)  * e'r*  and  Nt (r)  ■ e~l*ir3  using  the  points  2, 1 .8, 1.6, 1 .4, 
1.2,1.  Three  values  of  H were  used  in  each  case:  0.5,1,  and  2-  The  results  are  shown  of  figures  2 and  3.  Wescethat 
in  spite  of  the  crude  piecewise  linear  model  and  sparsity  of  points,  acriracy  is  fair.  The  choice  of  H has  little  effect 
beyond  the  initial  point.  The  method  is  simple  enough  so  that  no  elaborate  programnrng  was  necessary.  Clearly  a 
denser  set  of  points  and/or  the  use  of  a piecewise  parabolic  model  with  continuous  derivatives  would  have  improved 
the  results. 
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Figure  1-  Results  of  numeric*!  inversion  for  k = 0. 


Another  procedure  that  could  be  used  relies  on  the 
expansion  of  N(r)  in  inverse  powers  of  r\ 


"<')=  £ (?) 

/ 


so  that 


(sin  <?y-J  df} [ft1  * 


If  /^(b)  can  be  represented  as  a series  in  inverse  powers 
of  b,  the  coefficients  • can  be  obtained-  Such  an 
expansion  could  be  obtained  by  observing  that  for  large 
b the  lowest  value  of  / dominates  the  behavior  of 
/(h);  after  that  term  is  subtracted,  the  next  higher  value 
dominates,  etc. 


/It  1 2 r 

: 


Figure  3.-  Results  of  numerical  inversion  for  * ■ 1. 


EXTENSION  TO  THREE-DIMENSIONAL 
DISTRIBUTIONS  - APPLICATIONS 


The  method  described  above  is  capable  of  inverting 
a two-dimensional  distribution  in  a given  plane.  Since  we 
are  assuming  that  rays  remain  linear,  a three-dimensional 
distribution  can  be  inverted  by  applying  the  two-dimen- 
sional procedure  in  a series  of  parallel  planes  (or  any  other 
convenient  set  of  planes  covering  the  region  of  interest 
sufficiently  densely). 

The  methods  outlined  here  could  be  used  for  diag- 
nostics of  planetary  atmospheres  provided  sufficient  exper- 
imental data  were  available.  Note  that  for  a three-dimen- 
sional distribution  we  would  need  a three-parameter  set  of 
data,  whereas  even  a single-parameter  family  is  hard  to 
obtain.  More  realistically,  perhaps,  the  method  could  be 
applied  to  millimeter  wave  (or  laser)  plasma  diagnostics 
where  we  often  encounter  rotating  cylindrical  (but  not 
cylindrically  symmetric)  distributions.  The  apparent 
rotation  is  caused  by  an  azimuthally  traveling  instability. 
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The  two- parameter  set  of  data  would  be  easier  to  obtain  in  this  case  since  the  angular  paremeter  a would  be 
equivalent  to  time-  This  substitution  is  possible  because  of  the  normally  rather  slow  rate  of  rotation  of  the 
plasma  distribution.  Hence  one  would  n*ed  only  a set  of  phase  shift  versus  time  curves  for  different  values  of 
the  impact  parameter. 
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DISCUSSION 


Kay:  You  mentioned  that  the  medium  has  to  be  weak.  It  seems  that  the  approximation  you  used  would  be  valid 

even  if  it  isn't  weak  as  long  as  it’s  slowly  varying. 

Shmoys;  No.  The  approximation  made  here  is  that  the  rays  are  straight. 


ELECTROMAGNETIC  INVERSE  SCATTERING* 
Norbert  N.  Bojarskit 


ABSTRACT 


N73-H615 


A three-dimensional  electromagnetic  inverse  scattering  identity,  based  on  the  physical  optics  approximation,  is 
developed  for  the  monostatic  scattered  far  field  cross  section  of  perfect  conductors.  Uniqueness  of  this  inverse  identity 
is  proven.  This  identity  requires  complete  scattering  information  for  all  frequencies  and  aspect  angles.  A nonsingular 
integral  equation  is  developed  for  the  arbitrary  case  of  incomplete  frequence  and/or  aspect  angle  scattering  information. 
A general  closed-form  solution  to  this  integral  equation  is  developed,  which  yields  the  shape  of  the  scatterer  from  such 
incomplete  information.  A specific  practical  radar  solution  is  presented.  The  resolution  of  this  solution  is  developed, 
yielding  short-pulse  target  resolution  radar  system  parameter  equations.  The  special  cases  of  two-  and  one-dimensional 
inverse  scattering  and  the  special  case  of  a priori  knowledge  of  scatterer  symmetry  are  treated  in  some  detail.  The  merits 
of  this  solution  over  the  conventional  radar  imaging  technique  are  discussed. 


INTRODUCTION 


The  direct  scattering  problem,  whether  electromagnetic,  acoustic,  particle,  or  quantum  mechanical,  is  defined  as 
the  problem  of  predicting  the  scattered  quantities,  given  the  incident  quantities,  the  relevant  description  of  the  scatterer, 
and  the  appropriate  laws  governing  the  interaction.  The  backscattering  and  forwardscattering  problems  are  the  special 
monoslatic  case  and  the  special  bistatic  case  of  scattering  in  the  direction  of  incidence,  respectively. 

The  inverse  scattering  problem  is  defined  as  the  problem  of  determining  the  relevant  quantities  describing  the 
scatterer,  given  the  incident  and  scattered  quantities,  and  the  appropriate  laws  governing  the  interaction.  The  unique- 
ness and  well-behavedness  problems  must  also  be  taken  as  an  integral  part  of  the  inverse  scattering  problem.  The  well- 
behavedness  problem  is  the  problem  of  determining  the  degree  of  smoothness  and  continuity  with  which  the  so-called 
“output  data”  vary  with  respect  to  the  so-called  “input  data."  The  uniqueness  problem  must  also  deal  with  the 
question  of  incomplete  input  data. 

The  electromagnetic  inverse  scattering  problem  is  thus  defmed  as  the  problem  of  determining  the  size,  shape  and 
electromagnetic  properties  distributions  (conductivity,  susceptibility,  and  permeability  distributions)  of  a scatterer, 
given  the  incident  and  scattered  electromagnetic  fields,  and  the  electromagnetic  field  equations  (Maxwell’s  equations 
and  the  appropriate  wave  equations;  the  constitutive  equations  and  their  coefficients  are  taken  as  part  of  the  electro- 
magnetic properties  of  the  scatterer);  and  the  determination  of  whether  this  problem  is  uniquely  solvable  for  incom- 
plete input  data  - the  various  permutations  of  incomplete  bistatic  aspect  angles,  incomplete  monostatic  aspect  angles, 
incomplete  frequency,  monochromatic  data  only,  incomplete  polarization  matrix,  amplitude  (power  cross-section) 
data  only,  and  scattered  far  field  data  only.  Also  of  interest  is  the  inverse  scattering  problem  for  which  there  is  some 
a priori  information  about  the  scatterer,  such  as  that  the  conductivity  of  the  scatterer  is  infinite  (perfect  scatterer)  or 
that  the  geometry  of  the  scatterer  is  of  axial  symmetry. 

This  paper  is  restricted  to  the  monostatic  far  field  special  case,  a restriction  [being characteristically]  inherent 
to  the  radar  application.  For  mathematical  reasons  that  will  become  evident,  the  further  special  case  of  a priori 


•This  research  was  supported  in  part  by  USAF  contracts  AF  30  (602)  3961  and  F 30602-68-C-0260. 
+ Physicist;  Research  Contractor  and  Consultant  to  the  Department  of  Defense 
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knowledge  of  the  scatterer  being  a perfect  conductor  is  treated.  Ftr  both  these  cases,  special  attention  is  given  to 
the  further  special  case  of  the  finite  three-dimensional  aperture,  the  case  for  which  monostatic  scattering  data  are 
available  for  a limited  and  incomplete  frequency  and  aspect  angle  domain.  From  a practical  short-pulse  target 
resolution  radar  point  of  view,  this  special  case  is  of  fundamental  and  primary  interest. 


THE  PHYSICAL  OPTICS  CROSS  SECTION 


Here  the  physical  optics  approximation  is  taken  as  the  basis  for  the  direct  scattering  theory,  and  its  validity  for. 
and  consistency  with,  short-pulse  target  resolution  radar  concepts  is  discussed  briefly. 

The  scattered  magnetic  field  Hs  in  terms  of  the  induced  (by  the  incident  field)  surface  current  density  K on 
the  surface  of  a perfect  conductor  is  given  by  [Stratton,  1941  ] : 

(1) 

S 

where  the  Green’s  function  $ and  its  gradient  are  giv  r by 


4 nr 


(2) 


= (3) 

r 

The  physical  optics  approximation  [Van  Bladel.  1964]  for  the  induced  surface  current  density  in  terms  of  the 
incident  magnetic  field  //*  is 

12 n X H‘,  on  the  “illuminated”  segment  of  s ^ 

0 , on  the  “shadow”  segment  of  s 

Thus,  by  Eqs.  (1),  (3),  and  (4),  the  scattered  magnetic  far  field  Hi,  in  terms  of  the  wave  number  propagation 
vector  k%  of  the  scattered  far  field,  is 


Hi  - -2fkIX 


I QnXH'ds 
V-n< 0 


(5) 


If  the  incident  field  //' 


is  taken  as  a plane  wave  of  the  form 


HUjeiki.x  (6) 

in  the  vicinity  of  the  scatterer,  where  ki  is  the  wave  number  propagation  vector  of  the  incident  field,  and  the  range 
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and  phase  normalized  (in  the  coordinate  system  in  which  the  scatterer  is  described)  scattered  far  field  is  taken  as 


Hi 


Set**-* 


(7) 


then  for  the  monostatic  case,  that  is, 


ks  = -*«'  a k 


(8) 


Eq.  (5)  reduces  with  the  aid  of  Eqs.  (2)  and  (8),  and  the  transversality  of  the  incident  field  to 


S 


U- 


*4, 


2> k‘xk-ds/ 


n X) 


(9) 


Consistent  with  the  conventional  definition  [Berkowitz , 1965]  of  the  power  cross  section  o and  the  field 
cross  section  p 


5 = p/ 


o = pp*=4itr1 


(10) 

00 


Eq.  (9)  yields  for  the  physical  optics  field  cross  section  the  well-known  expression  [Berkowitz,  1965] 


i = -t/  e~Jlik’x  Jfc-ds 


02) 


For  a short-pulse  target  resolution  radar  system  to  be  effective,  iu  pulse  length  must  be  short  compared  to  the 
target  size;  furthermore,  since  the  fractional  bandwidth  of  such  a pulse  is  limited  by  practical  considerations  to  much 
less  than  unity,  it  follows  that  the  largest  wavelength  in  the  spectrum  of  the  transmitted  pulse  must  be  very  short 
indeed  compared  to  the  target  size.  The  physical  optics  approximation  Eq.  (12)  is  thus  a good  model  for  die  direct 
scattering  theory  for  such  a short-pulse  target  resolution  radar  system.  For  a detailed  discussion  of  the  physical 
meaning  and  implications  of  the  physical  optics  approximation,  that  is,  its  being  a total  first  order  local  scattering 
theory,  consistent  with  short-pulse  radar  concepts,  see  Bojanki  [1968] . 


THE  INVERSE  SCATTERING  IDENTITY 


With  the  aid  of  the  physical  optics  approximation,  we  now  develop  a basic  inverse  scattering  identity  that  states 
that  the  characteristic  function  (in  three-dimensional  space)  of  a scatterer  is  related  to  the  field  cross-section  (in  three- 
dimensional  k-space)  by  a three-dimensional  Fourier  transform.  Uniqueness  of  the  solution  for  finite-sized  scatterers 
is  established. 
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Introducing  the  variable  k defined  as 


k — 2k 


03) 


yields  for  Eq.  (12) 


pOO  = 


jkfe 


e~tK-XK.is 

rK’n>o 


(14) 


Thus 


P*(-«) 


rkf- 


e~t*'XK‘ds 

rK'tt<  0 


(IS) 


and 


p(<0  + p*(-k)  = 


/e^^K-rfs  + -~/e 

yfin  J ^ v/TtT  / 
**•«><>  ■'«'« 


K -ds 

fc-nCo 


(16) 


e~iK'x  K‘d  s 


V7) 


Since  the  integrand  of  Eq.  (17)  is  continuous  and  differentiable  on  s and  in  v banded  by  i,  it  follows  .y 
Gauss’  theorem  that 


p(>0  + p*(-»t) 


'vi/' 


7.(K<,-1K-X)dv 


K 

\/4F. 


e-tK'X  dv 


(18) 


(19) 


Introducing  the  quantity  T(k)  defined  by 


K* 


(20) 


yields  for  Eq.  (19) 


r\x)=  fe~i*‘X  dv 
Jv 

Defining  the  characteristic  function  y(x)  of  the  scatterer  by 


7(*)s 


1 , x in  v 
0,  x not  in  v 


permits  the  reformulation  of  Eq.  (21)  as  the  three-dimensional  Fourier  integral 


m 


■£- 


x >(x)  d3x 


if  the  volume  v of  the  scatterer  is  finite,  then  by  Eq.  (22) 


5 


|T<x)J  d3x  = v<°° 


(21) 


(22) 


(23) 


(24) 


it  thus  follows  from  Eq.  (23)  that  for  finite-size  scatterers  the  three-dimensional  inverse  Fourier  transform  of  T(k) 
exists  uniquely. 


7(x) 


£ 


ei*-x  rXx)d3it 


(25) 


which,  with  the  aid  of  Eq.  (20),  can  be  reformulated  as 
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After  replacing  k by  -a  in  the  second  integral  of  Eq.  (26),  we  obtain 


Both  inverse  scattering  identities  Eqs.  (25)  and  (29)  clearly  require  complete  scattering  information;  namely, 
knowledge  of  p(k)  over  all  k space-  that  is,  all  frequencies  and  all  aspect  angles. 


THE  FINITE  APERTURE  INTECRAL  EQUATION 


In  this  section  we  develop  a general,  nonsingular,  inverse  scattering  integral  equation,  solutions  to  which  permit 
the  determination  of  an  appropriate  maximum  of  information  about  a scatterer  from  incomplete  scattering  data.  The 
use  of  the  physical  optics  approximation  is  further  justified. 


In  practice  p(x)  is  known  (measurable)  only  for  an  incomplete  finite  portion  of  the  complete  k space;  namely 
a k -space  aperture  consisting  of  a limited  (finite)  frequency  band  and  a limited  aspect  angles  band.  Furthermore, 

Eq.  (23)  is  valid  only  in  the  physical  optics  regime  (wavelength  short  compared  to  the  overall  size  of  the  scatterer), 
and  hence  Eq.  (25)  must  either  include  fictitious  (physical  optics  scattering  data  in  the  Rayleigh  regime,  which  is 
physically  not  realizable)  low-frequency  scattering  data  r(x),  or  no  such  data  at  all.  We  thus  turn  to  the  problem 
of  determining  what  can  be  deduced  about  a scatterer  [7(x)l  from  such  limited  high-frequency  finite  aperture  data. 

Let  A(x)  be  an  aperture  function  defined  as 

A(k)  = C(k)  W(k)  (30) 


where  C(k)  is  a characteristic  aperture  function  defined  as 


CM 


1 , for  k for  which  T(x)  is  known 
0,  for  k for  which  T(x)  is  unknown 


(31) 
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and  where  W(k)  is  any  appropriately  chosen  (in  general  nonzero)  aperture  weighting  function,  subject  to  the 
conditions 


/ 

•£»oo 


\A(K)\  dJK  < 


(-2) 


Thus,  if  the  k -space  volume  of  the  aperture  is  finite,  anc  the  aperture  weighting  function  is  appropriately 
chosen,  then  the  three-dimensional  inverse  Foarier  transforms  of  the  aperture  and  characteristic  aperture  functions 
exist  uniquely  . 


o(x)  = 


c(x)  = 


e«-*/t<K|dJic 


(2tr) 


L 


(33) 


(34) 


Thus,  by  Eq.  (25)  and  the  utree-dimensional  convolution  theorem  for  three-dimensional  Fourier  transforms,  it 
follows  that 


1 


«(*)•**)  = r(*M(«)  d3K 


(35) 


which  by  Eqs.  (30)  and  (31)  reduces  to 


a(x)*T(x)  » j<i K'X  r(x)  Jp(«)  d*x 


(36) 


where  T(k)  is  clearly  known  in  the  domain  of  integration  C ~ the  aperture.  The  right-hand  side  of  Eq.  (36)  can 
thus  be  taken  as  known,  say  the  known  function  ;(x): 


1 


f 


C 


(37) 


The  three-dimensional  inverse  scattering  problem  for  a finite  aperture  thus  r tC  .es  by  Eqs.  (36)  and  (37)  to  the 
three-dimensional  nonsingular  convolution  integral  equation  (a  Fredholm  integral  equation  of  the  first  kind) 


a(x)*T<x)«g(x) 


(38) 
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This  integral  equation  can  be  solved  numerically  by  a variety  of  existing  techniques  such  as  the  matrix  methods 
of  Ritz-Galerkin  [Hildebrand,  1965] , the  associated  least-squares  best  estimate  method  [Hildebrand,  1965] , and  the 
associated  moments  method  of  Harrington  [1968] ; the  eigenfunction  expression  method  of  Toraldo  Di  F ranch' 

[Wolf,  1970] ; leading  to  so-called  “super  resolution";  and  the  K-space  method  of  Bojanki  [1971  ] , which  ai»o  leads 
to  super  resolution.  Several  closed-form  solutions  of  Eq.  (38)  for  apertures  of  specific  geometry  have  been  obtained 
by  Lewis  [1969] ; an  alternate  closed-form  solution  of  Eq.  (38)  for  apertures  of  general  arbitrary  geometry  is  presented 
in  the  next  section. 


A SOLUTION  OF  THE  FINITE  APERTURE  INTEGRAL  EQUATION 


Here  we  develop  a closed-form  solution  to  the  integral  equation  of  the  preceding  section.  This  solution  is  valid 
for  any  arbitrarily  shaped  finite  x- space  aperture  (of  incomplete  frequency  and  aspect  angles  scattering  data).  The 
full  details  of  a practical  frequency  and  aspect  angles  band  limited  right  rectangular  quasi-conic  section  aperture  are 
then  developed  as  an  example. 

The  -elution  of  Eq.  (38)  for  y(x)  is  greatly  facilitated  by  the  special  properties  of  y(x)-a  priori  knowledge 
that  y(x)  is  » characteristic  function  of  the  form  Eq.  (22)-and  the  possible  judicious  choice  of  the  aperature  func- 
tion F(k): 


Let  the  x*  axis  be  chosen  as  passing  through  the 
(near)  center  of  the  aperture  A (fig.  1).  Next,  let  the 
aperture  function  W(k)  be  chosen  as 


W(k)*/K  j 


(39) 


It  thus  follows  from  Eqs.  (30),  (32),  (33),  and  (34),  and 
again  the  three-dimensional  convolution  theorem  for  three- 
dimensional  Fourier  transforms,  that 


«(x)-c(x)^(x1)6(x1)6'(x,)  (40) 


where  c(x)  is  known: 


•»*M 


(41) 


Figure  1.-  The  aperture  and  surface  coordinate  system 


It  thus  follows  from  Eq.  (38)  that 


c(x)*6(x,  )6(x,)6'(xj)*7(x)  **(*) 


(42) 


which  reduces  to 


c(x)e~L  «g(x) 
3*3 


(«) 
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Since  ?(x)  is  a characteristic  function  of  the  form  (22)— in  particular,  for  fixed  values  of  x , and  x , , y(x 3) 
is  a dual  step  function  in  x2  of  unity  magnitude  and  steps  at  the  lower  and  upper  surfaces,  say  Z"(x i ,x2)  and 
Z(x  i ,x  a ),  respectively,  of  the  scatterer  (fig.  1 )— it  follows  that 


=6[xJ-Z'(x,.x1)]-6[x1-Z(x,.xJ)]  (44) 


which  by  Eq.  (43)  yields 


c(x)*6(xj -Z'(x,  jr2)|  -c(x)*6[x, -Z(x,,x2)l  =*(*) 


(45) 


Examination  of  Eqs.  (20>,  (26),  and  (37),  and  symmetry  and  physical  considerations  (the  implications  of  Eqs.  (4) 
and  ( 12)| . thus  yields  for  the  upper  surface  Z(X|  ,x2 ) only 


c(x).6[x,  -Z(X| ,x2)] 


(46) 


provided 


(47) 


which  is  assured  by  Eq.  (39). 

By  the  implications  of  Eqs.  (4)  and  (12),  it  is  physically  reasonable  that  information  about  the  lower  surface 
Z'(x,  jtj ) should  only  be  obtainable  from  scattering  data  from  the  lower  image  aperture  ,4*(-k).  It  is  now  evident 
that  the  introduction  of  the  image  aperture  served  the  sole  purpose  of  a mathematical  artifice,  which  permitted  the 
application  of  Gauss*  theorem  to  Eq.  (12),  yielding  Eqs.  (25)  and  (29);  and  that  knowledge  of  scattering  data  in  this 
image  aperture  is  not  needed. 

The  three-dimensional  convolution  on  the  left-hand  side  of  Eq.  (46),  say  X(x),  reduces  to 


Thus 


X(x)  hc(x)*6(xj  -Z(x,,x2)l 


c(x,  -x!ai  ~XjJCj  -Z(x,>j)]  dx,'  dx2 


(48) 

(49) 


where  X(x)  is  a resolution  density  function  that  is  a measure  of  the  location  of  the  upper  surface  Z(x,  ,x2 ). 


(50) 
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That  X(x)  is  indeed  such  a resolution  density  function  can  best  be  visualized  by  considering  the  limiting  case  of  an 
infinite  aperture  function  A(k)  for  which  c(jc j = 6(x);  for  such  an  aperture, 


X(x) 


-x2')6[x3-Z(x,',Xi)]dx,'  dx,' 


(51) 


= 6(x,  -Z(x,,x,)) 


(52) 


by  Eq.  (49),  whereas  for  a practical  realistic  aperture  of  finite  x -space  extent,  the  spatial  extent  of  the  nonvanishing 
portion  of  c(x),  and  hence  the  nonvanishing  portion  of  X(x),  is  still  small  compared  to  the  size  of  the  scatterer. 

In  fact,  this  resolution  function  X(  ,<  determines  the  resolution  of  the  solution  Eq.  (50),  a resolution  that  can 
be  exceeded  only  by  the  super-resolut.on  method  mentioned  earher. 

A three-dimensional  density  plot  of  X(x)  thus  represents  the  smeared  geometrical  image  of  the  surface  of  the 
scatterer,  the  spatial  extent  of  the  smearing  clearly  being  the  spatial  extent  of  c(x)  - the  resolution. 

A best  estimate  of  Z(xi  ,x3)  can  alternatively  be  obtained  by  a variety  of  correlation  [between  Eqs.  (41 ) and 
(49)j  methods,  employing  Fourier  transform  theory. 


THE  RESOLUTION 


The  resolution  in  x space  is  clearly  the  spatial  extent  of  the  nonvanishing  size  of  X(x);  it  thus  follows  from 
Eqs.  (30),  (32),  and  (49)  that  the  resolution  in  any  one  direction  in  x space  is  the  reciprocal  of  the  k -spatial  extent  of 
the  (nonvanishing  portion  of)  the  aperture  <4(k)  in  that  one  direction. 

The  finite  aperture  inverse  scattering  solution  Eq.  (50)  can  clearly  be  reformulated  in  a variety  of  desired 
practical  (radar)  spherical  coordinate  systems.  For  the  particular  spherical  coordinate  system  shown  in  figure  2: 


/cos  $ sinf  \ 

x = x { sin  { I 

(53) 

\cos  { cos  f / 

t!  x3  dx  cos  f d{  df 

(54) 

the  inverse  scattering  solution  Eq.  (SO)  for  the  right  rectangula.  quasiconic  section  aperture  shown  becomes 


X(*)‘ 


/■-sXlo  Xo 

•it,  •'-*>  *^o 


cos*(xi  Sinf  + Xj  COS £)*(*,£{) cost  df  cos’jdfxdx 


(55) 


where 


1 A 

(56) 

I 

kj  -ito+yAx 

(57) 

6-43 


ft/  ' 


l ifure  2.-  The  mixed  coordinate  system 


where  k0  and  Ax  are  the  carrier  and  bandwidth  of  the 
transmitted  spectrum  respectively: 

Examination  of  Eqs.  (41 ) and  (49),  after  a similar 
reformulation  into  this  same  spherical  coordinate  system, 
readily  reveals  (after  small  angles  approximations)  the 
resolution  in  range  and  cross  ranges  to  be 


Ax,  = — AJ  = 2f0  (58) 

k0  Af 

Ax,=— 77  A|s2f0  (59) 

Ko  a? 


Ax, 


Ate 


(60) 


This  resolution  can  only  be  exceeded  by  the  earlier 
mentioned  techniques  of  super  resolution. 

Equations  (58),  (59),  and  (60)  are  thus  the  equations 
for  the  parameters  of  a target  resolution  radar  system. 


THE  SPECIAL  CASES 


The  special  cases  of  the  one-  and  two-dimensional  inverse  scattering  problem  (scattering  data  restricted  to  a 
x-space  line  or  plane  respectively,  obtained  from  a three-dimensional  scatterer)  can  be  treated  by  applying  the  methods 
of  Lewis  J1969J  or  Bojanki  (1967J  to  equation  (30)  et  seq. 

Namely,  by  choosing  the  characteristic  aperture  function  for  the  two-  and  one-dimensional  special  cases,  respec- 
tively, as 


C(k)  = C(<m,Kj)5(kj)  (61) 

C(k)  = C(k,)6(k,)6(kj;  (62) 

The  three-dimensional  convolution  integral  Eq.  (38)  reduces  to  the  two-  and  one-dimensional  integral  equations 

g(x,S2)  (63) 

and 

«(*i  )•«(*,)  * *(*i)  (64) 


6-44 


respectively,  where  0 is  the  thickness  distribution  function  in  the  x3  direction,  a is  the  area  distribution  function 
orthogonal  to  the  x,  direction  of  the  scatterer,  and  g(xj  ,x2)  and  $(xi ),  respectively,  reduce  to 


g(x  I 


(2»r)  Jc 


^■rJr 


g(? ,)-r-  h,K'Xi  rcitiW*,)*, 
Jc 


(65) 


(66) 


The  nonsingular  integral  equations  (63)  and  (64)  can  be  solved  for  0 and  a by  any  of  the  previously  discussed 
means. 

The  further  special  case  of  a priori  knowledge  of  the  scatterer  being  a surface  of  revolution  about  the  x 2 axis 
(fig.  2)  can  clearly  be  treated  by  the  preceding  two-dimensional  formulation  Eqs.  (63)  and  (65)  by  recognizing  that 
the  profile  function  (generatrix  of  revolution)  of  the  scatterer  is  1 / 20(0, x2  ),  thereby  further  simplifying  Eqs.  (63) 
and  (65)  after  the  appropriate  modifications.  A more  direct  treatment  of  the  problem  of  the  surface  of  revolution 
will  be  presented  subsequently. 

The  special  cases  of  the  aperture  A(k)  being  of  certain  given  geometrical  shapes  can  be  treated  by  applying  the 
method  of  Lewis  [1969]  to  equation  (30)  et  seq. 

The  special  cases  of  a priori  knowledge  of  the  scatterer  possessing  certain  geometrical  symmetries  can  be 
treated  by  applying  the  methods  of  Bofarski  [1967] . We  use  the  case  of  the  scatterer  known  to  be  a surface  of  rev- 
olution  about  the  x2  axis  (fig.  2)  as  anexample.  For  such  a scatterer,  the  monostatic  cross  section  clearly  is  inde- 
pendent of  the  longitudinal  aspect  angle  £, 


p(k,5.?)=p(M) 


(67) 


Furthermore,  the  profile  function  x2(x2)  of  such  a surface  of  revolution  is  given  by  the  function  describing  the 
upper  surface  Z(x,  ,x2)  at  the  plane  xt  =0.  Taking  the  limits  of  integration  over  £ in  Eq.  (50)  as  from  0 to 
2sr»  setting  x,  =0  in  Eq.  (50),  an  1.  using  the  integral  representation  of  the  Bessel  functions,  it  follows  that 


X(*i,*s) 


2^if  f elKX'  Sm  cos  *> P(*> Q 1 

* ‘1  H 


(68) 


which  for  small-angle  approximations  yields  the  doubly  truncated  two-dimensional  mixed  Hankel  (Fourier-Bessei)- 
Fourier  transform 


X(*i,*s) 


p(«,{)  d|  k dx 


(69) 
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As  expected,  only  two-dimensional  scattering  information  (in  k and  £;  i.e.,  in  frequency  and  one  aspect  angle, 
the  latitudinal  aspect  angle)  is  required  for  an  inverse  scattering  solution  by  Eq.  (68)  or  (69). 

Solutions  are  also  obtainable  for  the  various  combinations  of  the  special  cases  cited. 


CONCLUSIONS 


The  solutions  developed  in  this  paper  can  be  viewed  as  solutions  to  the  frequency-band-limited  and  aspect-angles- 
limited  short-pulse  synthetic  aperture  radar  imaging  (and  associated  data  processing)  problem.  The  solutions  presented 
are  based  on  rigorous  electromagnetic  scattering  and  inverse  scattering  theory  applicable  to  spatially  distributed  scat- 
tered. yielding  real  three-dimensional  geometrical  images;  in  contrast  with  conventional  two-dimensional  radar  imaging 
technique,  which  is  based  on  the  heuristic  approach  of  isolated  point  scattered  (scattering  centers)  concepts,  yielding 
so-called  “radar  images,”  or  maps  [ Rihaczek , 1969] , which  most  often  do  not  resemble  the  real  geometrical  images  of 
the  scatterer. 

The  merits  of  the  solutions  presented  in  this  paper  over  the  conventional  radar  imaging  technique  deserve  the 
following  iu'.ther  examination.  The  rather  unsatisfactory  results  of  the  conventional  radar  imaging  technique  arc 
essentially  the  consequence  of  the  heuristic  approach  to  the  problem:  The  approach  consists  of  considering  a spatially 
extended  target  as  a fictitious  ensemble  of  identifiable,  stationary,  noninteractive,  nondispersive,  and  isotropic  point 
scattered.  From  a rigorous  electromagnetic  scattering  point  of  view,  a spatially  extended  scatterer  is  not  an  ensemble 
ot  point  scattered,  nor  are  these  fictitious  point  scattered  in  principle  identifiable,  stationary,  noninteractive,  non- 
di-.persive,  and/or  isotropic.  (The  point  scattered  are  not  always  identifiable  by  virtue  of  the  so-called  registration 
problem;  i.e.,  that  the  point  scattered  can  be  continuously,  consistently,  and  correctly  identified  for  various  aspect 
angles.  The  point  scattered  are  not  always  stationary  due  to  the  fictitious  relocation  caused  by  changing  aspect 
angles.)  Furthermore,  this  technique  does  not  contain  basic  unique  existence  considerations,  and/or  rigorous  consid- 
erations of  the  problem  of  optimizing  the  results  for  incomplete  aspect  angles  and/or  frequency  information  availa- 
bility. The  attempts  to  convert  radar  images  to  geometrical  images  have  thus  failed  for  precisely  these  reasons. 

The  solutions  of  this  paper  alleviate  all  these  objections  to  the  so-called  radar  imaging  technique  by  the  regorous 
application  of  electromagnetic  inverse  scattering  theory,  based  on  rigorous  direct  scattering  theory  (and  not  based  on 
the  heuristic  model  of  a spatially  extended  scatterer  as  a fictitious  ensemble  of  identifiable,  stationary,  noninterac’.ive. 
nondispersive,  isotropic  point  scattered).  It,  therefore,  avoids  the  problem  of  the  conversion  of  radar  images  to  geo- 
metrical images,  by  sidestepping  and  avoiding  the  radar  image  altogether,  and  addressing  itself  to  the  problem  of 
generating  actual  geometrical  images  directly  from  radar  data,  including  unique  optimal  results  from  incomplete 
observation  aspect  angles  and  frequency  information. 
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DISCUSSION 


Unidentified  speaker:  If  your  set  C in  phase  space  is  bounded  in  some  high-frequency  shell  then  did  you  say  you 
could  get  a good  determination  of  the  body? 

Bojarski:  Yes.  If  the  extent  of  C is  such  that  its  transform  is  small  compared  to  the  body  size  it  will  map  it  out 

for  you.  The  resolution  in  x space  is  essentially  divided  by  its  size  in  k space. 

Kay:  Part  of  the  limitation  imposed  on  your  aperture  function  is  due  to  the  breakdown  of  physical  optics. 

Bojarski:  The  aperture  must  not  be  in  the  Rayleigh  or  resonance  regime. 

Unidentified  speaker:  Do  you  have  a criterion  for  what  limit  you  should  place  on  the  aperture  function  to 

guarantee  this? 

Bojarski:  Typically,  if  the  lowest  frequency  is  of  the  order  of  either  the  curvature  or  2 or  3 limes  the  Ka  number 

then  you  get  answers  within  S or  6%  without  noise. 

Moses:  Have  you  done  num:rical  modeling  to  check  this? 

Bojarski:  Yes.  We  took  the  Mie  series  solution  for  a sphere,  truncated  the  data  over  a 10%  bandwidth,  10/ca  num- 

ber, 1 / 10  rad,  to  see  if  we  could  reconstruct  the  sphere.  We  did  so  within  6%  of  the  profile  function. 
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A review  of  exact  techniques  for  determining  the  surface  of  a three-dimensional  perfectly  conducting  body  is 
given,  followed  by  some  new  results  on  the  uniqueness  question  concerning  the  number  of  measurements  that  may 
be  required  to  explicitly  determine  the  surface  of  the  body.  It  is  then  shown  that  the  nonhomogeneous  but  spher- 
ically symmetric  dielectric  electromagnetic  case  is  reducible  to  a scalar  inverse  rroblem  that  can  be  treated  by  known 
techniques. 


INTRODUCTION 


A review  of  exact  techniques  for  determining  the  surface  of  a three-dimensional  perfectly  conducting  body  is 
given.  This  is  followed  by  a discussion  on  the  uniqueness  problem  for  the  perfectly  conducting  case,  a subject  not 
well  understood  but  important,  since  it  delineates  the  necessary  set  of  measurements  that  may  have  to  be  made. 
With  regard  to  this  problem,  some  new  results  are  presented.  Finally  the  case  of  a spherically  symmetric  object  is 
treated-  that  of  a conducting  sphere  surrounded  by  a dielectric  that  is  nonhomogeneous  in  the  radical  direction. 

It  is  shown  that  this  vector  inverse  problem  can  be  reduced  to  a scalar  inverse  problem  which  car.  be  solved  by 
known  techniques. 


REVIEW  OF  THE  PERFECTLY  CONDUCTING  CASE 


For  a perfectly  conducting  three-dimensional  body,  the  inverse  scattering  problem  has  been  treated  using 
continuous  wave  information.  For  a fixed  direction  of  incidence,  polarization  and  frequency  of  thr  incident  wave, 
the  total  (scattered  plus  incident)  near  field  can  be  computed  from  knowledge  of  the  far  scattered  field  given  over 
the  complete  unit  sphere  of  directions.  Namely,  if  the  scattered  far  field  has  the  form  in  spherical  polar  coordinates, 

eikR 

Es  = E0 

R 


the  far  field  data  is  given  by  prescribing  Eo(0, 0)  as  a function  of  the  angular  variables  for  0 < 0 < ir,  0 < <t>  < 2ir. 

Theie  are  various  methods  of  obtaining  the  near  field  from  the  far  field  data.  Wilcox  [19S6]  gives  an  iterative 
procedure  for  determining  the  near  scattered  field  by  expressing  it  in  the  form 


E* 


eikR 

R 


OO 


I 


Eft 
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and  showing  that  E„(0, 0)  can  be  calculated  through  a set  of  recursive  relations  involving  partial  differential  operators 
in  the  variables  ( 0,0 ),  with  initial  data  given  by  prescribing  Eo(0, 0).  The  expansion  is  convergent  down  to  the  min- 
imum sphere  (with  center  the  origin)  enclosing  the  body.  However,  this  domain  of  convergence  can  be  extended  fur 
ther,  under  certain  conditions  as  discussed  below. 

Another  approach  is  to  use  the  plane  wave  expansion  where  the  near  scattered  field  at  a point  x , is  given  by 
the  integral 


-rr/i-joo  -nr 

E*  I I e*'  ‘ xEo(a,0)sinadad0 

0 0 

where  k'  = Jt  (sin  a cos  0,  sin  a sin  0,  cos  a).  Although  this  integral  involves  knowledge  of  the  far  field  data  Eo(0,0) 
for  complex  angles  of  0 , it  is  shown  [ Weston  et  al. , 1 968  ] , that  this  can  be  obtained  if  Eo(0, 0)  is  shown  only  for 
the  real  angles  for  the  complete  unit  sphere  of  directions.  The  above  expression  is  convergent  in  general  only  for  the 
half  plane  z > z0 , where  z = z0  is  the  tangent  plane  to  the  obstacle  such  that  the  obstacle  lies  in  the  half  space 
z <z0.  Again  for  certain  cases  discussed  below,  the  domain  of  convergence  may  greater,  namely  a half  space 
z >Zi , where  z1  <z0. 

Even  though  the  plane  representation  given  above  gives  the  near  scattered  field  in  the  half-space,  by  rotation  of 
coordinate  system,  or  by  using  other  forms  of  the  expansion  [rwmfcj',  1962] , the  near  field  can  be  obtained  by  sim- 
ilar representations  for  other  half  spaces;  hence  in  practice,  the  near  field  can  be  obtained  in  at  least  the  union  of  all 
half  spaces  exterior  to  the  body  - outside  the  minimum  convex  shape  enclosing  the  body.  Although  uie  plane  wave 
representation  is  mainly  useful  as  a theoretical  tool  to  prove  results,  it  is  extremely  useful  to  obtain  asymptotic  expres- 
sions for  the  near  field  in  the  case  of  high-frequency  scattering,  even  if  the  far  scattered  field  is  not  known  over  the 
complete  unit  sphere,  but  in  some  sector  such  as  the  cap  0 < 0 <,  0O . Stationary  phase  techniques  may  be  employed 
to  evaluate  the  integral,  and  it  is  shown  [Weston  and  Boemer,  1968]  that  this  is  equivalent  to  tracing  the  rays  (of 
geometric  optics)  back  to  the  surface.  Thi"  asymptotic  technique  of  using  the  plane  wave  representation  is  equivalent 
in  practice  to  the  analog  device  of  holography  where  the  phase  and  amplitude  is  measured  over  a sector  usually  in  the 
near  field. 

Another  approach  which  is  useful  numerically  is  to  expand  the  near  scattered  field  in  terms  of  the  spherical 
vector  wave  functions  [Stratton,  1941]  as  follows 


E*(x)  = 


aea,mnateo,mn  + ^eo,mnneo,roi 


The  coefficients  a and  b vomitting  subscripts)  are  obtained  by  matching  the  above  expression  to  the  far  field  data. 
This  involves  in  pr  ictice  inverting  a matrix  to  solve  a linear  system  of  equations.  Some  results  on  the  matrices  and 
determinants  involved  is  given  by  Boemer  [1970] . 

As  mentioned  above  the  various  expressions  for  the  near  scattered  field  converge  in  some  domain  exterior  to 
the  body.  However,  the  domain  of  convergence  of  the  expressions  may  be  extended  part  way  inside  the  body,  if  the 
surface  of  the  body  is  infinitely  differentiable.  The  domain  of  convergence  is  determined  by  the  singularities  of  the 
scattered  field.  As  an  example,  the  exact  expression  for  the  scattered  field  by  a perfectly  conducting  sphere  centered 
at  the  origin  (the  Mie  series)  converges  and  satisfies  the  free  space  Maxwell’s  equations  everywhere,  except  at  the  origin 
where  it  has  a singularity.  Physicially,  the  scattered  field  appears  to  rise  from  an  equivalent  source  at  the  origin  in  con- 
trast to  the  actual  physical  currents  induced  on  the  sphere. 


6-49 


The  location  of  the  domain  of  convergence  is  important  for  two  reasons:  (1)  to  know  when  the  scattered  field 
representations  may  exter A part  way  inside  the  body,  thus  providing  a means  to  determine  the  surface  from  the  total 
field;  and  (2)  from  a practical  standpoint,  when  a finite  set  of  measurements  is  made,  the  scattered  field  has  to  be 
approximated  by  a finite  sum,  and  the  knowledge  of  the  domain  of  the  convergence  is  important  for  estimating  the 
errors  in  the  scattered  field  in  the  vicinity  of  the  surface.  This  was  ignored  by  Bates  [1971  ] when  he  assumed  that 
the  absolute  value  of  the  sum  of  an  infinite  series  of  terms  of  order  e was  less  than  e,  where  e is  a small  parameter. 

This  leads  to  the  question  of  what  conditions  must  be  imposed  on  the  surface  for  the  singularities  to  lie  inside. 

It  is  well  known  that  the  total  field  is  singular  at  a discontinuity  like  an  edge.  From  physical  grounds,  one  must  ex- 
pect that  the  surface  be  infinitely  differentiable  for  the  singularities  to  be  inside.  This  was  shown  for  convex  por- 
tions of  the  body  in  Weston  etal.,  [1968] , and  an  algorithm  was  presented  that  yields  the  domain  of  convergence. 

The  results  of  the  algorithm  indicate  that  for  a prolate  spheroid,  the  singularities  lie  on  the  axis  between  the  focal 
points,  which  agrees  with  well-known  results  of  R.  F.  Millar  for  the  ellipfic  cylinder.  Alternative  procedures  (other 
than  by  the  exterior  representations  given  above)  must  be  used  to  obtain  the  scattered  field  in  cavity  portions  of  the 
body- that  is,  portions  exterior  to  the  body  but  inside  the  domain  of  convergence  of  the  various  exterior  representa- 
tions given  above,  Weston  et  al.,  [1968] . 

The  conducting  surface  is  found  by  searching  for  the  surfaces  for  which  the  tangential  components  of  the  tot"l 
field  vanish.  The  uniqueness  question  (namely  is  there  more  than  one  such  surface  for  which  this  is  true)  is  discussed 
in  the  next  section.  From  a numerical  standpoint,  the  procedure  is  to  look  for  the  surfaces  for  which  EX  E*  = 0, 
where  E*  is  the  complex  conjugate  of  E.  This  is  a necessary  but  not  sufficient  condition.  It  implies  that  the  dir- 
ection of  the  electric  field  is  real.  Another  necessary  condition  is  E • H = 0.  Hence,  once  surfaces  are  found  such  that 
E X E*  = 0 and  H • E = 0,  then  those  surfaces  for  which  nXE  = 0(n  the  unit  normal  to  the  surface)  can  be  easily 
picked  out. 


UNIQUENESS  FOR  PERFECTLY  CONDUCTING  BODIES 


We  now  consider  the  question  of  uniqueness  for  inverse  scattering  of  perfectly  conducting  bodies.  We  have 
pointed  out  that  for  a fixed  frequency,  direction,  and  polarization  of  incident  wave,  the  knowledge  of  the  complete 
scattered  far  field,  led  to  the  determination  of  the  total  field,  from  which  the  scattering  surface  is  obtained  by  locat- 
ing the  surface  for  which  tangential  E vanishes.  The  question  of  uniqueness  arises;  namely,  is  there  more  than  one 
such  surface  for  which  the  tangential  electric  field  vanishes,  if  more  than  one  such  surface  exists,  how  many  sepa- 
rate measurements  must  be  performed  to  evaluate  the  correct  surface.  By  separate  measurements,  we  mean  the 
observation  of  the  complete  scattered  feild  for  a different  incident  wave. 

We  shall  restrict  ourselves  to  scattering  bodies  contained  by  simply  connected  closed  smooth  \Leis  1967] 
surfaces.  The  smoothness  condition  is  not  that  restrictive  physically,  since  for  a finite  wavelength,  sharp  edges 
exhibit  the  same  characteristics  as  edges  rounded  with  radii  of  curvature  much  less  than  a wavelength. 

Let  S0  be  the  surface  of  the  scattering  body.  We  know  that  the  scattered  field  must  possess  singularities  which 
lie  inside  Sq.  Hence,  if  in  performing  the  inverse  problem,  we  find  surfaces  additional  to  So  such  that  the  tangential 
electric  field  vanishes,  we  can  eliminate  from  consideration,  those  closed  surfaces  that  do  not  contain  S0  ■ For  if  such  a 
closed  surface  should  lie  outside  S0  (but  not  contain  S0),  it  will  contain  no  singularities  in  its  interior,  and  hence 
cannot  represent  an  obstacle. 

Tb  .re  may  exist  more  than  one  surface  St , S2 , . - . , on  which  the  tangential  electric  field  vanishes,  and  which 
enclose  $ At  present,  however,  we  need  consider  only  the  smallest  surface  St  enclosing  So . If  the  scattering 
obstacle’s  surface  is  infinitely  differentiable  the  singularities  cf  the  scattered  field  lie  well  inside  the  surface,  and  sur- 
faces could  exist  inside  So  such  that  tangential  E vanishes.  However,  to  simplify  the  analysis,  we  will  consider  the 
case  where  the  additional  surface  St  is  outside  So-  (The  other  case  may  be  treated  in  the  same  way.) 
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The  existence  of  the  vr;iishing  of  tangential  E on  the  surfaces  Sq  and  S,  implies  that  the  operating 
frequency  is  at  an  eigenfre^uency  of  the  volume  V enclosed  by  the  surface;  i.e.,  there  exist  eigenfunctions 
(Em,  Hm)  of  Maxwell’?  equations.  In  the  volume  V,  the  total  field  is  thus  a linear  combination  of  these 
eigenfunctions. 


To  det». mine  the  set  of  measurements  required  to  eliminate  St  as  a candidate  for  the  surface  of  the  obstacle, 
he  eigenf  nctions  and  their  properties  will  be  considered  in  more  detail.  The  field  at  any  point  y in  V can 
Ik  represented  in  terms  of  surface  currents  as  follows 


where 


E(y)  -f j0~j  X 7^-— ^-(7- 


eikR  , . 

Ra^ 


0) 

(2) 


where  x is  the  variable  of  integration  over  the  surfaces  5 = So  + St . If  it  is  postulated  that  the  field  quantities 
satisfy  the  free  space  Maxwell’s  equations,  in  the  interior  of  S0  and  exterior  to  St , and  the  radiation  condition  at 
infinity,  then  j and  j’  are  related  to  the  discontinuity  in  the  tangential  components  of  E and  H across  the  surfaces 
So  and  St  ,i.e., 


j = i.X(H--H+) 
j'  = n X (E  - E+) 

where  n is  the  unit  outward  normal  (out  of  V)  to  the  surfaces,  and  H~  and  H*  are  the  respective  values  on  the 
interior  (with  respect  to  V)  and  exterior  of  the  surfaces. 

The  eigenfunctions  can  be  represented  by  setting  j * 0,  and  j'  = -2ft,  giving 


E(v)  = 2 / /t  X 70  d"x  ye  V 


(3) 


Let  y approach  the  surface  S(  from  the  interior,  then  the  above  equation  yields 


n X E(y)  = /t+  Jviy)  X (ft X 70)dox  yeS 


and  similarly  letting  y approach  the  surface  Sq  from  the  interior  of  V, 


i X E(y)  = ft  + |n(>')X(ftX70)daJt  .ye$> 
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For  (E,H)  to  be  an  eigenfunction,  nX  E(.y)  = 0,  hence  the  necessary  and  sufficient  conditions  for  E(y)  given  by 
Eq.  (3)  to  represent  the  modes  is  that  p must  satisfy  the  following  integral  equations 


p+  I n(>’)X  (/IX  70)dox  = O 

ye  So 

i3 

(t+  f n(.y)  X (fxX  V<(>)dax  - 0 

y eS, 

’S 


(4) 


The  integral  operators  associated  with  the  above  homogeneous  integral  equations  are  compact  operators  [Muller,  1969] , 
with  respect  to  the  Banach  space  of  continuous  vector-valued  functions  defined  on  the  smooth  surfaces.  Because  of 
this,  we  have  from  well-known  results  that  the  null  space  is  finite  - there  are  only  a finite  number  N of  independent 
eigenfunctions  (Er/t,Hm).  By  introducing  the  adjoint  operator  [Muller,  1969]  upper-bound  estimates  on  N can  be 
obtained. 

We  come  to  the  main  question:  How  many  additional  measurements  must  be  made  to  eliminate  S,  as  being 
the  scattering  surface?  To  try  to  eliminate  S{  as  a candidate,  take  another  set  of  measurements  (different  incident 
wave  but  same  frequency)  and  find  the  total  field  and  hence  those  surfaces  for  which  tangential  E vanishes.  Again, 
we  will  obtain  Sq  since  it  is  the  obstacle,  but  we  may  or  may  not  obtain  S, . If  we  don't  obtain  it,  S,  is  eliminated, 
but  if  we  do  happen  to  obtain  Sx  again,  we  will  have  to  repeat  the  process,  with  another  set  of  measurements.  Does 
this  repeated  process  terminate?  !t  will  be  shown  that  it  does 

Let  H*(m)  and  H (n)  represent  the  incident  and  total  magnetic  fields  for  the  nth  set  of  measurements.  It  will 
be  shown  that  if  at  least  N + 1 separate  sets  of  measurements  are  made,  then  it  is  impossible  for  tangential  E(n)  to 

vanish  on  St  for  all  N + 1 sets.  For  if  tangential  E(n)  vanishes  on  Sx  for  n = 1,2 W + 1,  then  we  have  the 

following  integral  equation 


n X H(n.i  * 2n  X H‘(n)  + 21  n X [(n  X H)  X V<p]x&ox  (5) 

*5. 


the  solution  of  which  exists,  and  is  unique  provided  that  the  frequency  is  not  an  eigenfrequency  of  the  volume  con- 
tained by  S| . Since  the  tangential  component  of  H is  continuous  across  St , n X H on  St  is  a linear  combination 
of  the  N eigenfunctions  of  the  volume  contained  by  So  and  Sx . This  means  that  the  set  «X  H(n), 
n = 1, 2, . . . ,/V+l,  is  a linearly  dependent  set,  and  from  the  integral  equation  (5)  that  n X Hfyi),  n * 1, 1,. . . ,2V+  1, 
must  be  a linearly  dependent  set.  This  is  a contradiction,  however,  since  the  incident  fields  are  independent  to  begin 
with.  Hence,  it  is  impossible  for  n X E to  vanish  on  S,  for  the  N + 1 sets  of  measurements  (at  the  same  frequency); 
thus,  at  most  N + 1 sets  of  measurements  need  to  taken  to  eliminate  S] . In  practice  one  may  need  to  take  only  a 
few  separate  measurements  (at  the  same  frequency). 

When  more  than  one  surface  S,  ,Sj , . . . in  addition  to  So  exists  for  which  the  tangential  components  of  the 
electric  field  vanishes,  the  eigenfunctions  associated  with  the  volume  enclosed  by  So  and  Sj , where  Sj  contains 
Si,  must  be  a subset  that  belongs  to  the  eigenfunctions  of  the  volume  enclosed  by  So  and  St  since  the  fields  are 
continuous  across  S, . Hence,  the  same  number  N + I of  measurements  is  needed  at  most  to  eliminate  Sj  as  a 
possible  surface  of  the  obstacle. 
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An  alternative  approach  [Weston  anlBoemer,  1969) , to  eliminate  the  extra  surface  is  to  change  the  frequency. 
This  is  based  on  the  result  that  the  eigenfrequencies  of  a fixed  finite  volume  are  a discrete  set.  Hence,  by  making  a 
set  of  measurements  at  a shifted  frequency,  St  will  no  longer  appear  as  the  result  of  the  new  measurement,  pro- 
vided that  the  frequency  has  not  beer,  shifted  to  a new  eigenfrequency.  Theoretically,  all  one  needs  is  the  set  of 
measurements  at  a frequency  and  at  an  infinitesimal  increase  in  the  frequency.  In  numerical  practice,  however,  the 
problem  of  eliminating  as  a contender  for  the  scattering  surface  may  not  be  accomplished  by  such  a slight  shift 
in  frequency,  since  the  effect  of  changing  frequency  may  be  to  shift  or  perturb  the  surface  S , . From  a numerical 
standpoint  the  shift  in  frequency  must  be  sufficiently  great  that  the  possible  shift  in  the  surface  S i is  large  enough 
to  detect  numerically,  taking  into  account  numerical  errors. 


INVERSE  SCATTERING  FOR  SPHERICALLY  SYMMETRIC  BODIES 


We  now  consider  the  case  where  the  body  is  not  only  perfectly  conducting  but  also  dielectric.  To  obtain  a 
solution  using  known  techniques  for  inverse  scattering,  we  consider  the  case  where  the  body  is  restricted  to  be 
spherically  symmetric,  in  which  case  the  relative  dielectric  constant  e will  be  taken  to  be  a real  function  of  the 
radial  coordinate  R . In  particular,  the  body  will  be  taken  to  be  a perfect  conducting  sphere  of  radius  R 0 , with 
dielectric  coating  of  outer  radius  R , . 

The  spherically  symmetric  assumption  allows  one  to  express  the  fields  in  terms  of  two  scalar  potentials  $ and 
X as  follows  with  time  dependence  exp(-t«r)  assumed  but  omitted.  The  spatial  dependence  of  the  electric  field  is 
given  by 


E = +/  V X (t//R)  +—  V X V X (Xv/eR) 
ke 


which  for  R>Rt  reduces  to 


E = +f  V X (*R)  +-7X7X  (xR) 
k 


(6) 


(7) 


where  k is  the  free  space  wave  number.  The  assymption  of  a perfectly  conducting  surface  at  R=R0  imposes  the 
following  boundary  conditions  on  \p  and  x 


t(R  o)  =0 


3 (RSex) 


a r 


JR  o 


(8) 


It  can  be  shown  that  the  scalar  potentials  must  satisfy  the  following  equations 


e*  = 0 


(9) 


V*X  + 


(10) 
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where 


p =^~  and  p = e_l/1  0 0 

d R 

For  the  inverse  scattering  problem,  we  would  like  to  determine  how  the  far  field  measurements  can  be  separated 
out  to  uncouple  their  dependence  on  the  two  potentials,  in  which  case  the  vector  inverse  problem  is  reduced  to  a scalar 
inverse  problem. 

Before  investigating  this,  we  need  to  consider  the  incident  field.  Because  of  spherical  symmetry  we  can  fix  the 
direction  of  the  incident  wave  to  lie  in  the  direction  of  the  positive  z axis,  and  polarized  in  the  x direction. 


E'  = ixe+ikz  (12) 

It  can  be  shown  that  this  can  be  split  up  into  two  scalar  potentials  ij/'cos  0 and  i^/'sin  <p,  where  <p  is  the  azimuthal 
angle,  as  follows. 


E*  = + / VX  sin  $>R)  +•“■  vx  V X (ifO  cos  0R)  0 3) 

with 

*'(0,/?)  = — ! L+J*z~L(i  + cos0>+'*K--(l  -cos0>-'**  04) 

kR  sin  0 1 2 2 J 

It  then  follows  that  the  scalar  potentials  corresponding  to  the  total  field  have  the  form 

\p  - ^(A,0)sin  <t> 

X =x(R,0)cm4> 

For  R>R,,  the  potentials  can  be  split  into  scattered  and  incident  parts 


4* = 4/i  + 


where  ^ is  given  by  Eq.(  14)  and  xf  K tlr. 
will  have  the  form  for  R — •», 


(15) 


X “x'  + x5 

fhe  scattered  components  will  satisfy  the  radiation  condition  and  thus 


PS(R  ft  ft)  ~ $ (16) 

X$(/l,0,d)~£~f(0)cosd  (17) 
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From  this  and  Eq.  (7),  it  can  be  shown  that  the  far  scattered  field  has  the  form 


_ gikR  r ^ ^-1 

B>~f-jy5-kflCos0ifl-£0Sin0iJ  (18) 


where 


= Jl+iL 

6 sin  6 be 


9 b6  sin  e 


(19) 


(20) 


We  now  come  to  the  problem  of  separating  out  the  components  / and  g from  far  field  measurements.  If  the 
far  field  scattering  pattern  is  measured  for  0 < 0 < n,  and  a fixed  angle  0 which  is  not  a multiple  of  n/2  (otherwise, 
we  would  need  two  fixed  values  of  0),  Eg  and  can  then  be  obtained  as  functions  of  6 . From  Eqs.  ( 1 9)  and  (20) 
we  obtain  the  two  first-order  differential  equations 

9 9 tint  V 


-***-—  <*-/)♦—  (t-f) 
6 9 sind*  " dd*  " 


which  on  integration  yield, 


(21) 


„ f'mcjsa r£,-f  w 

1*  sind  L 1 sind'  l*  9] 


»o  a, 


For  the  fields  to  be  nonsingular  on  the  axis,  the  following  must  hold 

/(d) ~ 0(d),  g~0(d)  ford-0 


and 


/(d)-O(a- d),  g-O(a-d)  ford-tr 
hence,  the  constants  of  integration  to  be  taken  will  be 

d0  "0  and  d,  ■ ir 


(22) 
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It  thus  fallows 


f. ta» 

i - cosd'  r 

r.  + r 1 

/ lint’ 

i -co$e  [ 

, _ f*  sin  6 l + cos0'  f.. 

f,L  ;ri‘»'£*r 


The  tar  Held  components  can  now  he  separated  out,  giving 


/<«) 


2 J sin  6 1 ? cos  9 [ 


(23) 


(24) 


(25) 


where  the  upper  and  lower  sign  is  taken  according  as  6 >6'  or  6'>0.  Hence  once  Eg  and  are  known  for 
0 < 0 <i  tr,  f(0 ) is  determined  from  Eq.  (25).  The  vector  inverse  scattering  problem  now  is  reducible  to  the  following 
scalar  inverse  problem. 

Given  that 


* 4t(R,0)i in* 


is  a solution  of  the  equation 


71  ^ + k2e(R)\li  - 0 


subject  to  the  boundary  condition  ^ * 0 at  R - Rq,  the  problem  is  to  determine  e(R)  from  the  knowledge  of  the 
far  scattered  field  f(6)  where 


c JkR 

sin* 

Ki\ 


for  an  incident  wave  tM  of  the  form  given  by  expression  Eq.  (14). 

By  expanding  the  above  solution  in  terms  of  the  associated  Legendre  function  P^( cos  0)  involving  the  polar 
variable,  the  problem  is  reduced  to  a ■ r.e-dimensional  inverse  problem  involving  the  radial  variable.  Hence,  with 
knowledge  of  the  complete  scattered  Held  for  all  frequencies,  e(R)  can  be  computed  using  the  Gelfand  and  Levitan 
approach  \Newton,  1970] . 

For  the  case  when  R0  -*  °»,  but  the  thickness  of  the  dielectric  layer  (R,  -Ro)  remains  finite,  and  the  measure- 
ments  of  the  scattered  *>w.  are  made  in  the  rear-zone  back  scattered  direction,  the  three-dimensional  inverse  problem 
reduces  to  the  one-din«n»onal  slab  problem,  which  has  been  treated  elsewhere. 
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This  introduces  the  intermediate  case  of  obtaining  a uniform  asymptotic  expansion  to  the  inverse  scattering 
problem  when  R -*  0.  If  a uniform  asymptotic  expansion  can  be  obtained,  without  imposing  stringent  conditions 
on  e(R),  then  correction  teuns  could  be  obtained  to  the  slab  approximat>on  containing  the  effects  of  curvcture. 
Present  ray-tracing  or  geometric  optics  techniques  do  this,  but  these  are  limited  to  the  high-frequency  case  where 
e(R)  must  be  slowly  varying. 


COMMENTS 


We  have  concentrated  on  the  exact  three-dimensional  electromagnetic  inverse  scattering  problem.  Although 
in  practice  the  measurements  required  for  the  exact  inverse  problem  appear  to  be  out  of  the  realm  of  practicality, 
the  understanding  of  the  exact  problem  gives  a better  insight  into  the  limitations  and  errors  when  approximate  or 
asymptotic  techniques  are  employed,  and  can  lead  to  further  refmeme..t  of  these  m-'hods. 
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DISCUSSION 


Kay:  Klaus  and  Muller  related  the  spherical  harmonic  expansion  of  the  scattered  field  to  the  sphere  of  minimum 

radius  that  would  enclose  the  singularities.  This  sounds  close  to  your  results.  He  did  a similar  thing  for  the 
electromagnetic  case. 

H'eston.  Did  he  relate  the  behavior  where  the  singularities  lie  with  respect  to  the  body  shape? 

Kay:  What  he  did  was  not . nsidered  a boundary-value  problem  as  such.  He  considered  the  far  field  thst  resulted 

from  sources  of  any  kind  and  related  the  coefficients  of  that  spherical  harmonic  expansion  to  the  minimum 
enclosing  sphere.  The  relationship  to  your  problem  has  to  do  with  the  nature  of  the  induced  souices. 

Twersky:  Bob  Miller  at  NYU,  showed  for  the  two-dimensional  problem  that  the  corresponding  expansions  for  the 

circular  cylinder  the  Hankei  functions  converge  down  to  the  axis  and  for  the  elliptic  cylinder  converge  down 
to  the  focal  line. 

Unidentified  speaker:  Have  you  tried  this?  There  seems  to  be  a stability  problem.  You  have  the  function  only 

approximately  at  infinity,  how  could  determine  the  body? 

Weston:  The  big  problem  is  to  get  input  data.  It  has  been  tried  for  the  spheroid  and  gave  better  results  than  I 

expected. 
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7.  SEISMOLOGY 


This  chapter  is  concerned  primarily  with  the  earth’s  interior,  although  several  of  he  techniques  discussed  are 
finding  application  in  space  science  studies  of  other  solar  system  bodies  such  as  the  Moon.  The  geophysical  inverse 
problem  is  concerned  with  the  inversion  of  seisnnc  data  to  obtain  profiles  of  parameters  which  describe  the  earth’s 
interior.  The  techniques  described  here  have  also  found  application  in  radio-occultation  studies  (Chapter  3),  and 
certain  of  the  techniques  rely  on  the  mathematical  algorithms  developed  in  the  particle  scattering  (Cl«apter  5)  and 
electromagnetic  scattering  (Chapter  6)  disciplines. 

R.  A.  Phinney  organized  and  chaired  the  session  devoted  to  seismology.  The  paper  by  Gilbert  is  a 
transcription,  edited  by  Dr.  Phinney,  of  the  oral  presentation. 
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ABSTRACT  N73- 


The  analytic  solution  to  the  linear  inverse  piiblem  provides  estimates  of  the  uncertainty  of  the  solution  in 
terms  of  standard  deviations  of  corrections  to  a j articular  solution,  resolution  of  parameter  adjustments,  and  inhuma- 
tion distribution  among  the  observations.  Monte  Ca.  lo  inversion,  when  properly  executed,  can  provide  all  the  same 
kinds  of  information  for  nonlinear  problems,  "roper  execution  requires  a relatively  uniform  sampling  of  all  possible 
models.  The  expense  of  performing  Monte  C urio  invt  rsion  generally  requires  strategies  to  improve  the  probability 
of  finding  passing  models.  Such  strategies  can  lead  to  a very  strong  bias  in  the  distribution  of  models  examined 
unless  great  care  is  taken  in  their  application. 


INTRODUCTION 


Here  we  examine  the  general  inverse  problem  from  the  point  of  view  of  Monte  Carlo  inversion  (MCI).  First 
we  describe  the  framework  of  simple  linear  inverse  pt  iblems  to  establish  the  nomenclature  and  purpose  of  MCI. 

Next  we  will  see  that  MCI  is  a natural  alternative  to  the  malytic  inverse  (AI).  After  a discussion  of  MCI  techniques 
and  difficulties,  we  present  a short  discussion  of  the  applicability  of  MCI  for  determining  the  structure  of  the  earth. 

This  paper  is  devoted  to  a review  of  the  principles  of  MCI  and  does  not  include  specific  descriptions  of 
applications.  MCI  has  been  applied  to  various  types  of  seismic  data  by  several  authors.  Keilis-Borok  and 
Yanovskaya  [1967]  and  Wiggins  [1969]  have  inverted  body  wave  observations.  Press  and  deihler  [1964] , Press 
[1968, 1970c, 6] , and  V.  Keilis-Borok  aid  L.  Knopoff  [personal  communication,  1968]  have  inverted  various 
combinations  of  body  wave,  surface  wave  and  gravity  data.  In  addition,  Anderssen  [1970]  has  used  MCI  for 
estimating  the  conductivity  structure  of  the  earth. 

Let  us  consider  the  class  of  inverse  problems  dealing  with  the  earth.  We  assume  that  we  have  measured 
m gross  earth  data  (GED)  g'j,jm\, ...  ,m.  Backus  and  Gilbert  [1967]  define  gross  earth  data  as  any  of  the 
possible  observations  that  may  be  relevant  to  a particular  solution  of  the  general  inverse  problem  of  the  earth.  We 
also  assume  that  we  can  represent  the  structure  of  the  earth  by  n discrete  parameters  Pf,  f»l,...,n.  Any 
inverse  problem  depends  then  on  exploiting  an  analytical  (computational)  relationship  between  the  model  parameters 
and  the  expected  GED 


gjsfj(Pi)  Jm  1 m 

The  object  is  to  find  set(s)  of  solution  parameters  p)  that  will  minimize  the  difference  between  the  observed  and 
computed  GED  gj-gj . 

If  the  function  fjipi)  is  linear,  we  can  find  a completely  analytic  description  of  the  class  of  satisfactory 
models,  the  resolution  of  the  parameters  and  the  information  distribution  among  the  gross  earth  data.  If  the  function 
is  not  linear,  then  we  an  faced  with  the  choice  of  either  linearizing  //  in  the  neighborhood  of  a particular  solution 
or  resorting  to  alternate  procedures  such  as  MCI. 
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ANALYTIC  INVERSE  OF  LINEAR  PROBLEMS 


The  treatment  of  the  AI  as  expressed  here  is  a condensation  of  a much  more  extensive  survey  [ Wiggins, 
1972] . Other  excellent  treatments  of  AI  has  been  stated  by  Backus  and  Gilbert  [1967, 1968, 1970]  and  in  other 
papers  in  this  volume. 

Let  us  minimize  the  difference  between  gj  and  gj  by  expanding  the  functional  relationship  in  a Taylor's 

series 

n 

V 

APi  1 

i-i 

whcie 

&gj  ~ gj-gj 
APi  = p/-p, 

We  nov.  have  a set  of  m simultaneous  equations  to  solve  for  the  parameters  A p,- . Clearly,  one  must  be  able  to 
compute  both  the  function  /.  (p,  ) and  the  partial  derivatives  or  variational  parameters  dfjldpj  in  order  to  effect 
a solution. 

Now  consider  the  matrix  notation 


where  the  weights  Sj  * o/op  are  proportional  to  the  reciprocal  of  the  standard  deviations  of  the  GED  and  the 
weights  w/  are  proportional  to  layer  thicknesses  or  volumes.  If  we  parameterize  the  model  by  dividing  it  into  a 
set  of  layers  or  volumes  and  assigning  an  average  value  to  each  parameter,  then  this  specification  of  c of  makes  the 
length  of  the  vectors  A/ W-,/J  invariant  to  the  thicknesses  or  volumes  selected  for  parameterization. 

We  can  now  write  the  simultaneous  equations  as 

m|YJ  [a/5]  " = pejm 

Among  these  m equations,  we  will  find  some  k independent  relationships  between  A P and  A G where  generally 
k «n,m.  Note  that  A has  been  defined  so  that  indices  along  rows  of  A correspond  to  parameters  and  indices 
along  columns  of  A correspond  to  GED. 
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Lancsoz  (1961]  has  shown  that  the  matrix  AW-**  can  be  factored  as 


r » i r*i 

AW~*  - m U 

A 

v 

L J L 

U J 

where  U are  column  eigenvectors,  A is  a diagonal  matrix  of  eigenvalues,  and  V are  row  eigenvectors: 

AW“wVT  - UA  W"^ArU  - VtA 
(Note  that  the  V matrix  here  is  the  transpose  of  Lancsoz'  V matrix.) 

The  eigenvectors  Kj  have  a simple  geometric  interpretation.  The  vector  K,  is  that  vector  whose  dot  'toduct 
with  the  rows  of  AW-**  is  maximal.  That  is,  V,  W-*4ATAW-*4K,T  is  a maximum  subject  to  the  constra.  t that 
Vt  T ■ 1 . Thus  V \ is  a geometrical  average  of  the  rows  of  AW-**  . If  we  remove  the  component  of  the  Ay- 
parallel  to  Vi  , then  Vj  is  a vector  that  is  most  nearly  parallel  to  the  reminder,  and  so  forth. 

We  can  think  of  the  V(  as  a reparameterization  of  the  inverse  problem.  Each  Vj  represents  a particular  linear 
combination  of  the  oid  parameters.  We  define  a new  parameter  vector  of  length  k 

AP*  • VW^A P 


and,  since  the  V/  are  orthonormai 


AP  - W-)4VTA P* 

Substituting  A ■ UAVW^  and  VW^AP'  AP*  into  the  simultaneous  equations  KAPmAG  we  find 

UAA/>*  * AG 


which  has  a least-squares  solution 


AP*  - A"‘UTAG 

AP  - W-,/,VTA-,UTAG 


Since  we  have  weighted  the  simultaneous  equations  by  the  reciprocal  of  the  GED  standard  deviations,  we  can  apply 
the  theory  of  linear  least  squares  estimates  [see,  e.g  ..Johnson,  1963,  p.  106-1 10]  to  find  the  standard  deviations 
of  the  parameter  adjustments 


Generally,  there  are  not  k nonzero  eigenvalues  and  n-k  zero  eigenvalues  for  the  matrix  AW-1'2  . Rather, 
the  eigenvalues  approach  zero  asymptotically.  Thus  we  must  choose  some  k based  on  the  behavior  of  the  standard 
deviations  ay  . That  is,  we  select  an  upper  bound  for  the  parameter  standard  deviations  and  then  select  k such 
that  the  solution  standard  deviations  are  not  greater  than  that  bound. 

Now  let  us  examine  briefly  what  we  can  say  about  the  solution  of  the  inverse  problem  based  on  the  ab  we 
development.  We  have  determined  a set  of  k adjustments  to  add  to  the  model  based  on  the  eigenvectors  Vj.  The 
size  and  standard  deviations  of  the  adjustments  are  determined  by  the  GED  and  their  standard  deviations.  There  are 

a set  of  n-k  free  vectors  VoiJ*  1 n~k , which  are  orthogonal  to  the  Vj  and  whose  coefficents  are 

completely  undetermined.  In  the  linear  problem,  we  can  add  any  amount  of  any  of  the  vectors  V0l  to  the  model 
Pi  without  affecting  the  quality  of  the  fit  at  all. 

Each  of  the  k adjustments  arc  determined  by  a linear  combination  of  the  gross  earth  data.  The  particular 
combinations  are  given  by  the  column  eigenvectors  Uj.  That  is,  Ux  gives  the  explicit  linear  combination  of 
gross  earth  data  that  determines  the  coefficient  for  the  eigenvector  Vx . Thus  we  have  also  determined  a set  of 
k linear  relationships  between  *hc  GED.  If  any  data  value  falls  significantly  outside  these  linear  relationships,  then 
our  model  is  improperly  specified  or  ihe  observation  is  in  error. 

The  eigenvector:  Vj  and  l/j  can  be  further  manipulated  to  estimate  the  resolving  power  and  information 
distribution  of  the  p-oblem.  First  consider  the  k parameter  eigenvectors  Vj.  The  resolving  power  of  a set  of 
GED  is  determined  by  finding  the  smallest  sets  of  adjacent  parameters,  the  average  value  of  which  can  be 
determined  uniquely.  The  thicknesses  of  the  corresponding  sets  of  layers  are  said  to  be  the  resolved  thicknesses. 
Clearly,  since  the  k eigenvectors  Vj  determine  the  total  linear  relationship  between  the  parameters,  we  must 
explore  the  resolution  properties  by  considering  linear  combinations  of  the  Vj . There  are  a number  of  ways  of 
finding  particular  combinations  of  the  Vj  that  maximize  a set  of  adjacent  parameters  while  minimizing  all  the 
others.  A consideration  of  all  such  combinations  will  then  map  the  resolution  distribution  in  parameter  space.  We 
can  make  similar  manipulations  on  the  eigenvectors  Uj  to  determine  adjacent  sets  of  GED  that  provide  equivalent 
information.  All  these  types  of  analysis  should  be  performed  for  a complete  description  of  iny  particular  linear 
inverse  problem. 


MONTE  CARLO  INVERSE 


Suppose  that  we  have  a particular  solution  of  an  AI  problem  P0  * [p/  . . . pn\  together  with  the 
eigenvectors  Vj  and  their  standard  deviations  o*  as  determined  in  the  last  section.  Further,  let  us  assign  (large) 

standard  deviations  to  the  free  vectors  Voi-  Let  us  then  generate  a set  of  random  models  /*/,/=  1 /, 

by  adding  rardom  amounts  of  each  of  the  vectors  Vj  and  V0j  tc  the  particular  solution.  The  random  coefficient 
for  adding  each  vector  will  be  constrained  to  have  zero  mean  and  a standard  deviation  of  o*  or  o*t- 
respectively. 

The  object  of  this  exercise  is  to  realize  that  we  can  recover  the  eigenvectors  Vj  and  Voi  and  their  standard 
deviations  a*  and  o£j  from  the  models.  We  simply  form  the  model  matrix 


If  the  standard  deviations  were  distinct  values,  then  the  eigenvalues  of  this  matrix  will  also  be  distinct  and  will  be 
proportional  to  the  standard  deviations  o*  and  o£j  (arranged  by  descending  numerical  size)  and  the  eigenvectors 
will  be  equal  to  the  corresponding  vectors  Vj  and  V0j , Note  that  here  the  best  determined  vectors  have  the 
largest  strndard  deviations  as  opposed  to  the  analysis  of  the  A matrix  in  the  last  section. 
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If  for  each  of  the  models  P(  we  had  computed  a vector  of  grou  earth  data  G/ * [/,  (Pi ) ■ . . fm(P i )]  we 
could  alto  recover  the  iff  vectors  from  the  row  eigenvectors  of  the  matrix 


Ci  -C0 
GR~G  o 


Hem  . we  see  that  given  an  adequate  sampling  of  random  models,  ail  of  which  are  solutions  of  the  inverse  problem, 
wr  can  estimate  the  sime  quantities  that  were  determined  by  the  analytic  inverse:  an  average  solution,  the  standard 
de  nation  of  the  parameter  estimates,  the  resolving  power  in  parameter  space  and  the  information  distribution  among 
the  GED.  In  addition,  we  may  be  able  to  establish  absolute  bounds  on  the  range  of  parameters. 

In  principle  then,  MCI  is  completely  equivalent  to  Al.  Both  approaches  have  basic  difficulties.  At  requires 
the  ability  to  compute  the  partial  derivatives  bfj!  dp,  and  that  these  derivatives  remain  nearly  constant  throughout 
the  model  space.  MCI  can  completely  avoid  the  problem  of  computing  the  derivatives.  Thus  the  linearity  of  the 
derivatives  is  of  little  or  no  consequence  to  the  solution.  We  must,  howevei,  obtain  an  adequate  sampling  of  the 
model  space  and  therein  lies  the  difficulty.  Before  discussing  these  difficulties,  let  us  consider  a formal  statement  of 
the  MCI  procedure. 

In  MCI  we  begin  by  establishing  a space  f of  possible  models  and  seek  to  find  the  subspace  f#  of  allowable 
models.  We  will  consider  here  only  models  which  are  specified  by  a set  of  n parameters  p/.  Generally,  the  bounds 
of  this  space  are  defined  by  upper  and  lower  limits  on  the  values  of  the  p,- ; however,  much  more  complicated  limits 
can  be  established.  In  the  simplest  implementation  of  MCI  the  search  consists  of  the  following  steps: 

1 . Generate  a random  model  Pr  in  the  space  f . 

2.  Compute  the  corresponding  GED  Gr 

3.  If  the  differences  between  the  computed  and  observed  GED  are  less  than  some  fixed  multiple  c f the 
standard  deviations  of  the  GED,  then  add  the  model  Pr  to  a list  of  successful  models. 

4.  Return  to  step  1. 

The  direct  application  of  the  algorithm  just  described  to  most  real  inverse  problems  is  prohibitively  expensive. 

The  ratio  of  passing  models  to  rejected  models  is  usually  very  small,  and  the  cost  of  computing  the  corresponding 
GED  is  often  significant.  Most  researchers  attempt  to  solve  this  problem  by  increasing  the  number  of  constraints  on 
the  model  generation  procedure  or  by  factoring  the  problem  into  sequential  segments.  If  certain  GED  are  affected  by 
only  a small  segment  of  the  mod'  ’ , then  that  segment  is  regenerated  until  all  the  GED  involved  are  satisfied.  The  search 
then  proceeds  to  a new  segment,  which  again  is  regenerated  until  the  new  GED  involved  are  satisfied,  and  so  forth.  This 
procedure  greatly  facilitates  finding  models  but  may  skew  the  distribution  of  models  radically. 

Let  us  consider  a simple  example.  Our  model  will  consist  of  eight  parameters  P(,i*  1 8,  with  the  con- 
straints that  p(  * 1 8.  Thus  the  parameter  space  P can  be  represented  by  the  squares  in  figure  1 . The  only 

constraint  that  we  place  on  the  passing  models  is  that  the  parameters  be  monotonically  increasing:  p/>pj_j . This 
restriction  can  be  considered  either  a further  constraint  on  the  model  generation  or  a set  of  GED.  if  we  take  the 
latter  approach,  then  the  MCI  consists  of  generating  sets  of  eight  uniformly  distributed  (pseudo)  random  integers 
lying  between  1 and  8.  We  then  check  to  see  if  these  numbers  are  monotonically  increasing.  Clearly,  the  probability 
of  finding  such  models  is  very  small  (approximately  0.003).  After  much  computation,  we  would  find  that  the  proba- 
bility distribution  for  the  passing  models  would  be  as  is  shown  in  figure  1(a)  [Wiggins,  I9o9J.  Most  of  the  models 
occur  near  the  line  p/  ■ /,  and  very  few  occur  near  the  opposite  comers.  This  illustrates  the  greatest  weakness  of 
MCI:  the  probable  distribution  of  ptssiiig  models  may  have  little  or  no  relationship  to  the  total  range  of  possible 
models  or  to  those  models  which  may  be  physically  meaningful.  To  fully  explore  the  .pace  of  possible  models  for 
this  example,  we  would  need  to  examine  more  than  75,000  models! 
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Of  course,  we  can  improve  the  efficiency  of  the 
search  by  a factor  of  "•  300  simpiy  by  considering  the 
monotomc  criterion  as  a constraint  on  the  generation  of 
the  models  rather  than  as  GED.  That  is,  we  will  only  gen- 
eiate  models  for  which  p,-  >p,_| . After  the  generation 
of  each  p, . we  will  consider  only  values  of  pi+j  which 
are  >p,'.  If  we  use  a uniform  probability  distribution 
for  generating  each  p,\  the  distribution  of  passing  models 
will  be  as  shown  in  figure  1(b).  Nearly  all  the  models  lie 
near  the  upper  limit  of  the  range  of  possible  models.  This 
distribution  has  even  less  relationship  to  the  full  range  of 
possible  models  than  did  the  distribution  in  figure  1(a) 

The  skewing  of  the  model  distribution  illustrated  in  this 
example  is  a necessary  consequence  of  segmenting  the 
search  for  passing  models.  Segmenting  can  certainly 
vastly  improve  the  probability  of  finding  passing  models, 
but  the  distribution  of  models  found  may  be  very  dif- 
ferent from  what  would  be  generated  without  segmenta- 
tion. The  degree  of  skewing  depends  on  the  number  of  seg- 
ments and  their  interdependence.  In  this  example,  each 
parameter  represents  a separate  segment  and  the  segments 
are  strongly  interdependent.  Since  we  know  the  solution 
distribution  for  this  example,  we  know  how  to  alter  the 
probability  distribution  for  generating  each  p,  so  that  we 
can  obtain  a set  of  models  with  the  distribution  shown  in 
figure  1(a)  | Wiggins.  19^9 J.  In  general,  however,  the 
purpose  of  MCI  is  to  investigate  the  distribution  of  models, 
and  therefore  we  do  not  have  such  preknowledge  and 
cannot  compensate  for  the  effects  of  segmentation. 

This  discussion  can  best  be  summarized  as  follows 
The  distribution  of  models  examined  is  strongly  affected 
by  the  parameterization  and  constraints  imposed  or  ie 
generation  of  the  models.  A successful  MCI  procedure 
must  be  capable  of  generating  a large  proportion  of  models 
that  span  the  range  of  successfu'  models  with  a relatively 
uniform  probability  distribution.  Such  a desideratum  can 
seldom  be  realized  for  models  that  require  a large  number 
of  parameters  for  their  specification 

Kfitis-Buruk  and  Knopuff  (persona!  communicaticiij 
have  proposed  an  alternative  MCI  technique  cailed  “hedge- 
hog" that  avoids  some  of  the  problems  of  mode'  distribu- 
tion. In  their  scheme  each  parameter  is  allowed  to  have 
discrete  valu  rs  only.  They  perform  a random  search  until 
a passing  model  is  found.  Then  they  change  the  search 
strategy  to  that  of  examining  all  nearest  neighbors  to  pass- 
ing models  until  they  have  explored  the  entiic  space  of 
passing  models.  This  approach  fo’lows  from  their  belief 
ihat  one  must  have  all  possible  passing  models  available 
before  one  can  adequately  describe  the  statistics  of  the 
inversion. 


PARAMETER  NO. 


Figure  1.  Probabilit;  distributions  of  random  model'  which 
consist  of  eight  parameu  bach  parameter  p,-  is  restricted 
lc  discrete  integer  valt-c . oetween  1 and  8 such  that  p,  > 
Pj_ ] . The  upper  portion  sho»";  '*..;tribut:ss  fo<  a 
random  sampling  of  all  possible  models.  The  lower 
. -rtion  show',  a biased  distribution  pioduccd  by  improper 
sampling. 
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APPLICATIONS 

Seven?  ears  ago,  the  rule  of  thumb  for  inverting  GED  ■ ?s  that  the  variational  parameters  for  surface  waves 
and  free  oscillations  were  sufficiently  linear  so  that  A1  could  be  used,  and  that  the  variational  parameters  for  body 
waves  o re  so  twilnear  that  MCI  must  be  used.  This  picture  is  less  clear  today.  The  rang;  of  models  for  the  upper 
mantle  is  suffice  Jy  Urge  that  nc!  all  these  models  can  be  derived  by  line:,  perturbations  from  some  “average" 
model.  On  tie  otic  hand,  as  shown  by  Jordsn  and  Johnson  and  Gilbert  later  in  this  chapter,  A1  can  be  used 
for  body  waves,  >t  least  for  certain  well-behaved  pans  of  the  earth’s  interior.  Further,  Keilis-Borok  and  his 
colleagues  hw'.  suggested  a nonlinear  anal’ tic  technique  that  can  be  extended  to  place  absolute  bounds  on  the 
range  of  possible  models  that  can  be  obtained  f-  m body-wave  data.  Thus,  whereas  MCI  may  have  become 
obsolete  for  kn  siting  body-wa  - 'JED,  it  .nc  . -he  only  practical  approach  to  inverting  surface  wave  GED. 
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DISCUSSION 


Sabatier:  Could  you  give  computer  time  examples  for  several  methods,  say,  Monte  Carlo,  hedgehog  methods.  <.-c.. 

in  seismology? 

Mggins:  l can  only  give  a qualitative  answer.  The  hedgehog  method  is  an  approach  to  Monte-Carlo  espoused  by 

Keilis-Borok  and  Knopoff.  You  only  use  a random  search  unt'l  you  find  a particular  model  that  passes  and 
then  make  the  values  of  the  parameters  discrete  and  investigate  the  space  of  all  possible  models  that  are 
nearest  neighbors  to  passing  models.  As  regards  time  for  inverting,  the  hedgehog  or  straight  Monte  Carlo  are 
very  expensive  Compared  to  analytically  computing  the  range  of  possible  models  its  an  impossible  compari- 
son. To  get  the  envelope  takes  a trivial  time. 

Moses:  The  use  of  the  amplitudes  of  the  waves  suggests,  from  electromagnetic  analogs,  that  you  can  go  beyond 

turning  points.  Is  this  trie? 


i* 


Mgg’iis:  That’i  right.  The  Abel  equation  approach  to  inverting  has  been  extended  by  Gerrer  and  Markushevich 

to  include  low  -.elocity  cones.  We  do  not  need  to  deal  with  models  that  are  monotonically  increasing.  There 
is  a certain  ambiguity  in  how  you  construct  these  zones,  however. 


Gross:  The  travel  time  against  distrance  data  is  obtained  from  various  stations  that  are  distributed  in  some  fashion 

and  may  have  entirely  different  aspects  relative  to  the  epicenter.  From  one  earthquake  to  another  you  may 
have  waves  arriving  from  different  directions  at  each  individual  station  Yet  you  interpret  them  as  if  they  are 
all  on  a line.  Is  that  right? 

Wiggins:  We  make  the  assumption  that  the  earth  is  radially  symmetric  and  then  subject  the  observations  to  a 

number  of  incomplete  corrections  which  we  think  we  know. 

Gross:  How  -sensitive  are  the  bounds  on  the  model  to  the  radial  resolution  that  you  allow? 

higgins  Qualitatively,  the  widest  bounds  are  those  for  models  that  ace  cassinated  (stairstep).  When  we  force  the 
model  to  have  smoothly  varying  velocities,  we  have  a reduction  of  maybe  two  in  the  total  area  of  that  space. 
However,  it  is  not  clear  from  physical  grounds  that  the  earth  has  to  have  a smoothly  changing  velocity. 
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BACKUS-GILBERT  INVERSION  OF 
TRAVEL  TIME  DATA 


Leonard  Evans  Johnson* 
University  of  California  at  San  Diego 


ABSTRACT  Jj  7 3 ' 1 I 6 1 8 

This  paper  describes  the  application  of  the  Backus-Gilbert  theory  for  geophysical  inverse  problems  to  the 
seismic  body  wave  travel-time  problem.  In  particular,  it  is  shown  how  to  generate  earth  models  that  fit  travel-time 
data  to  within  one  standard  error  and  having  generated  such  models  how  to  describe  their  degree  of  uniqueness.  An 
example  is  given  to  illustrate  the  process. 


INTRODUCTION 


In  this  paper  I will  describe  the  application  of  the  Backus-Gilbert  theory  for  geophysical  inverse  problems 
(1967, 1968, 1970]  to  the  seismic  body  wave  travel  time  problem.  Given  a set  of  measured  travel  times  for  the  real 
e?-th  from  a particular  source  to  various  surface  locations,  what  is  the  seismic  velocity  as  a function  of  depth? 

The  traditional  method  of  inverting  travel  time  data  is  due  to  Herglotz  and  Wiechert,  who  observed  that  in  the 
case  of  a spherical  earth,  with  the  velocity  v a function  of  the  radius  r alone,  if  v{r)/r  increases  with  depth,  the 
inversion  amounts  to  solving  a form  of  Abel’s  integral  equation.  An  elementary  solution  is  given  by  Jeffreys  [1962] : 


where  p - dT/dA,  the  derivative  of  the  travel  time  curve;  pt  is  the  value  of  p at  A = A|  and  is  also  equal  to 
rt  lv i for  the  seismic  ray  that  has  its  deepest  point  of  penetration  at  r, ; a = radius  of  the  earth.  Now  p is  a known 
function  of  A,  and  Eq.(l)  thus  determines  r,  corresponding  to  A]  and  hence  to  rj/v,  and  so  determines  v as  a 
function  of  r. 

The  Herglotz-Wiechert  method  is  subject  to  several  limitations.  First,  the  method  requires  that  p = dT/d  A be 
known  for  all  values  of  A from  0 to  Ai . This  requirement  cannot  be  met  in  the  presence  of  a shadow  zone  in  the 
interval  0 to  Ai  in  which  there  are  no  arrivals  of  the  particular  seismic  phase  being  considered.  Shadow  zones  are 
caused  by  regions  in  the  earth  where  the  velocity  decreases  with  depth  at  a rate  such  that  dv/dr  > v/r.  That  the  earth 
has  such  a region,  at  least  for  shear  waves,  was  convincingly  demonstrated  by  Dorman,  et  al.  [1960] . Another  diffi- 
culty with  the  requirement  of  a complete  travel  time  curve  out  to  the  distance  range  of  interest  is  that  there  may  be 
multiplicities  in  the  curve.  These  multiplicities  are  caused  by  regions  in  the  earth  where  the  velocity  increases  with 
depth  at  a rate  such  that  rays  penetrating  to  different  depths  are  refrecied  to  the  same  location  on  the  surface.  This 
is  not  a theoretical  difficulty  provided  that  all  the  later  arrivals  are  taken  into  account.  In  practice,  however,  the 
time  intervals  between  the  different  arrivals  at  a given  distance  in  the  presence  of  a multiplicity  in  the  travel  time 
curve  are  usually  very  short  and  cannot  be  satisfactorily  resolved.  Second,  since  the  slope  of  the  travel  time  curve  is 
required,  the  data  are  usually  smoothed  and  fitf  with  a curve,  which  is  then  numerically  differentiated.  This 

* Present  Address:  Cooperative  Institute  for  Research  in  Environmental  Studies,  University  of  Colorado, 
Boulder,  CO. 
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process  strains  the  data  in  that  a large  amount  of  data  closely  spaced  with  respect  to  epicentral  distance  is  required 
for  accurate  slope  information.  A more  fundamental  consequence  of  the  finite  amount  of  data  available  at  any  given 
time  is  that  it  leads  to  a gross  nonuniqueness  in  the  solution  to  the  problem.  Backus  and  Gilbert  [1967]  prove  that 
the  set  of  earth  models  which  exactly  fit  the  observed  data  is  either  empty  or  an  infirite  dimensional  manifold.  Third, 
the  method  does  not  take  into  account  observational  errors  in  the  data  and  the  effect  that  these  errors  may  have  on 
the  solution. 

The  Herglotz-Wiechert  method  is  valuable  in  the  sense  that,  in  the  absence  of  low-velocity  zones,  it  establishes 
the  existence  of  a solution  to  the  seismic  velocity  inverse  problem.  Moreover,  Gerverand  Mcrkushevich  [1966,  1967] 
have  proved  that  if  we  are  given  an  infinite  amount  of  perfectly  accurate  travel  time  data  at  all  distances  from  sources 
above  and  below  all  the  low  velocity  zones,  a solution  exists  and  is  unique  except  in  the  low  velocity  zones  themselves. 
It  is  always  comforting  to  know  that  a solution  to  an  idealized  version  of  one’s  problem  exists  and  is  unique. 

The  Backus-Gilbert  method  of  inverting  travel  time  data  is  not  subject  to  the  limitations  mentioned  above. 

There  is  no  need  of  a complete  travel  time  curve,  there  is  no  need  to  fit  the  data  with  a smooth  curve  before  invert- 
ing, and  the  Backus-Gilbert  method  takes  observational  errors  in  the  data  into  account.  The  method  exploits  the 
nonuniqueness  of  the  problem  to  generate  solutions  that  fit  the  data  and,  most  important,  it  allows  us  to  make  a 
quantitative  statement  about  the  degree  of  nonuniqueness  of  the  solutions. 


INVERSION  THEORY 


We  consider  spherically  symmetric  earth  models  in  which  the  velocity  is  only  a function  of  the  radius  r.  We 
may  think  of  earth  models  as  members  of  a Hilbert  space  in  which  the  inner  product  of  a pair  of  functions  m.m’ 

is  defined  as  (m.m’)  = m(r)m’(r)dr  and  a norm  as  Dm  I = (m.m')11.  To  apply  the  Backus-Gilbert  method 

we  must  find,  to  first  order,  the  change  in  the  travel  times  6 T produced  by  a small  change  in  the  velocity  6 v(r). 
Following  Backus  and  Gilbert  [1969] , the  travel  time  of  a group  of  nondispersive  body  waves  from  a source  p,  to 
a receiver  p2  is  given  by 


f 

Jd, 


rPi 

T = I v_1  ds 

hi 


(2) 


where  ds  is  the  element  of  path  length.  According  to  Fermat’s  principle,  to  first  order  in  6 v , we  have 


fPi 

Jp, 


5 7'=  — I v 1 8v  ds 

hi 


(3) 


If  we  introduce  spherical  polar  coordinates,  radius  r,  colatitude  8 , and  longitude  X ; choose  units  so  that  the  radius 
of  the  earth  is  at  r = 1 ; and  consider  the  ray  path  from  r = 1,  8 = 0 to  r = 1 , 8 = A,  X - 0,  we  have  \Bullen,  1963] 


7-  2 | 

rP 


ri(r)v(r)~ 


[tko1  v]",/2 


5v(r)  dr 


(4) 
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where  rp  is  the  minimum  radius  reached  by  the  ray,  r)(r)  = rv(r)_I , and  p ~ v~xr  sin  / is  called  the  ray  parameter, 
which,  according  to  Snell's  law,  is  constant  along  the  ray.  The  angle  i is  the  angle  between  the  ray  and  the  local  radius 
vector.  At  r * rp,  i * 90°  and  p = rplvp  where  vp  - v(rp). 

We  can  regard  7/,  the  travel  time  calculated  from  a given  model  v(r)  at  epicentral  distance  A,,  r = 1, ...  ,1V, 
as  a nonlinear  functional  of  that  model.  To  linearize  the  problem,  we  appeal  to  Frechet  differentiability  of  the  func- 
tional 7}.  Then,  correct  to  first  order  in  the  small  quantity  6v(r),  6 7}  is  a well-defined  linear  functional  and  we 
write  Eq.  (4)  as 


A/(r)6v(r)  dr  (5) 


where  A/(r)  is  the  fth  data  kerne1  or  so-called  “Frechet  kernel”  and  is  defined  as 

A/(r)  * -2p(r)v(r)-1  |rj(r)J  - p2]  , rpl<r<\ 

- 0 0 <r<rp‘  (6) 

in  the  nonlinear  case,  the  data  kernels  are  different  for  different  Earth  models  v(r). 

It  is  often  preferable  to  consider  a relative  perturbation  to  the  model  rather  than  an  absolute  one.  If  we  define 
Gj (r)  * v(r)Kj(r)  and  m(r)  * 6v(r)/v(r),  we  can  write  Eq.  (S)  as 

67}  = f Gj(r)m(r)  dr  (7) 


and  G,(r)  is  now  our  data  kernel  for  the  model  V(r)  = lnv(r),  i.e.,  m(r)  = 6 V(r)  = 6v(r)/i>(r). 

Now,  the  Backus-Gilbert  method  requiies  the  data  kernels  to  be  square  integrabie,  and  it  is  obvious  from  Eq. 
(6)  that  this  is  not  the  case  for  G,-(r).  To  remedy  this  situation,  we  integrate  Eq.  (7)  by  parts  once  to  obtain 


6 7}  = m(l)G/(r)  - 


j4  §(r)m'(r)  dr 


(8) 


where  Gf(r)  »1  G,(r)  dr  and  m'(r)  = dm(r)/dr. 

Jo 

Associated  with  each  travel  time  7}  Is  an  observational  error  that  we  assume  has  zero  mean  and  variance  o* 
If  we  consider  6 7}  as  the  difference  between  the  observed  travel  time  for  the  real  earth  t Oj  and  the  travel 
time  calculated  for  a particular  model  Tf^ . and  define  y^  = 7}°^*  - 7}“*  we  can  write  Eq.  (8)  as 


ly  - Oj  <i?f(l  )<?/(!)  - 


f 


Gi(r)m'(r)  dr<7,  + of 


(9) 
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Our  objective  now  is  to  determine  m(r)  from  Eq.  (9)  given  a finite  number  of  known  7 ■ ± o-,  / 1,  - . . A'. 

Formulated  in  this  way,  the  problem  is  ill-posed  in  the  sense  that  the  solution  is  not  unique.  We  can  take  advantage 
of  this  nonuniqueness  if  we  are  willing  to  accept  any  m{r)  that  is  a solution  to  Eq.  (9).  A method  proposed  by 
Backus  and  Gilbert  [1967J  for  choosing  a solution  is  to  find  the  model  that  satisfies  Eq.  (9)  and  is  closest  in  the 
least-squares  sense  to  a starting  model.  In  particular,  we  seek  to  minimize 


1 

1 


dr  + ^ m(l)J 


(10) 


subject  to  the  constraints  Eq.  (9).  This  is  a classic  problem  in  the  calculus  of  variations,  and  if  we  introduce  Lagrange 
multipliers  Vj  and  carry  out  the  minimization  we  have  the  solution 

m '(r)  = - Vj  Gj(r) 

i 

m(l)  = Y Vj  Gj(  1 ) (II) 

/ 


To  determine  the  V-  we  substitute  Eq.  (1 1)  into  (9)  to  obtain 

Ti 


,-a<  W:  G,(r)  G,(r)  dr  + G/(l)  G;(1)J  Vj < 7/  + 


(12) 


We  define  the  symmetric  inner  product  matrix 


(r)Gj(r)  dr  + Gf-(  1 ) G,(  1 ) 


(13) 


and  write  Eq.  (12)  in  matrix  form  as 


y-<j<  a -v  <y+  a 


(14) 


We  notice  that  the  system  of  Eq.  (1 2,  or  (14)  would  be  much  simpler  to  solve  if  the  G;(r)  were  orthogonal.  Then, 
the  matrix  A would  be  diagonal  and  Eq.  ( 14)  would  be  a system  of  linear  algebraic  equations,  one  for  each  Vj.  Since  A 
is  symmetric  and  positive  definite  there  is  a linear  transformation  that  diagonalizes  A and  simultaneously  diagonalizes 
the  covariance  matrix  £••  of  the  observed  data  [ Gilbert , 1971  ] . Usually,  we  assume  the  data  have  independent  stan- 
dard errors  </j  so  that  Ejj  is  already  diagonal,  but  this  may  not  be  the  case.  Following  Gilbert  11971 J,  we  let  T be 
the  matrix  of  the  diagonalizing  transfoimation  and  the  transformed  version  of  Eq.  (14)  becomes 

(T-y < T- V<(l-y  +A-1'2)  (15) 


where  A-1  is  the  transformed  diagonal  covariance  matrix. 
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When  the  data  kernels  are  orthogonal,  we  have  from  Eqs.  (10)  and  (i  l) 

- I m'(ry  ar  * 1 -m1  = Y (T  • V))  (16) 

2J0  2 

/ 

Thus,  the  minimum  in  Eq.  (16)  occurs  when  each  (T  * V\  in  Eq.  (15)  is  as  small  as  possible.  We  write  the 
expression  t = (T  •?- A-I/I),  u - (T*y  + A'1'2),  C = T'U  and  we  have 

®j  <ci<ui  i = \, . . . ,N  (17) 

When  8,-  >0  we  take  q = 8,-,  when  «,  < 0 we  take  q = u,  and  q = 0,  otherwise,  Gilbert  [1971]  shows 
that  the  element'  of  the  transformed  covariance  matrix  E are  the  inverse  eigenvalues  of  the  matrr  - a~l  Ana- 
Thus,  the  transformed  data  can  be  ranked  according  to  standard  error;  that  is,  large  eigenvalues  of  a correspond  to 
transformed  data  with  small  standard  error.  This  allows  us  to  keep  track  of  the  growth  of  the  perturbation  to  the 
model  in  Eq.  (16)  and  to  reject  those  transformed  data  which  have  standard  errors  that  are  larger  than  we  wish  to 
consider. 

Once  we  have  found  the  v,’s  from  Eq.  (17)  we  calculate  m '(r)  from  Eq.  (1 1)  and  integrate  once  to  get 
m(r)  = 6v(r)/v(r).  This  integration  produces  a very  smooth  perturbation  Sv(r)  to  the  model.  We  new  have  a new 
model  v(r)  + 6v(r)  from  which  we  can  calculate  new  travel  times  to  compare  with  the  observed  travel  times.  Since 
we  have  linearized  the  problem,  the  new  calculated  travel  times  may  not  agree  with  the  observed  travel  times  to  with- 
in one  standard  error,  so  we  repeat  the  process  outlined  above.  The  process  should  converge  provided  the  initial 
values  of  y(  = TPbs  - Tf*1  are  not  too  large. 


RESOLVING  POWER  THEORY 


Now  that  we  have  a method  for  generating  models  that  fit  the  observed  data,  what  can  we  say  about  their 
uniqueness?  When  we  have  onl>  a finite  amount  uf  data  we  cannot  expect  to  resolve  details  of  arbitrarily  snail 
scale.  The  best  we  can  hope  for  is  that  our  data  provide  us  with  an  estimate  of  the  true  value  of  the  mode),  at  any 
particular  radius  that  is  in  some  sense  a smoothed  or  averaged  version  of  the  structure  around  that  radius. 

Neglecting  errors  in  the  data  for  the  moment  we  have 


7r  * | Gj(r)m(r)dr 
o 


08) 


If  we  wish  to  consider  linear  averages  of  m(r),  we  have 


i 


m(r)  dr 


(19) 


We  would  like  to  choose  the  constants  q in  Eq.  (19)  so  that  the  function  in  square  brackets  is  localized  around  some 
value  of  the  radius,  say  r0.  In  other  words,  if  we  define 
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(20) 


A(r,  r0) 


)G,-(r) 


I 


we  would  like  A(r,  r0)  to  resemble  a Dirac  de'ta  function  centered  on  r0  ; if  this  were  possible,  Eq.  (19)  would 
simply  give  m(r0 ).  the  exact  value  of  m(r)  at  r0 . As  mentioned  above,  it  is  impossible  to  construct  a perfect 
delta  function  with  nly  a finite  amount  of  data,  but  by  choosing  the  a,-  carefully  we  may  be  able  to  do  a good  job 
of  approximating  a delta  function  at  ra . Then  Eq.  (19)  would  give  us 


i 


J A (r,  r0  )m  (r)  u. 
a 


Cm.  A> 


(21) 


where  < n:,  A > is  a smoothed  or  averaged  '"•-ion  of  m (r)  around  r0 . The  averaging  length  would  correspond  to 
the  width  of  A (r,  r0 ) at  r0 . 


How  can  we  choose  the  a,  so  that  A{r , rn)  is  an  approximation  to  a delta  function?  Backus  and  Gilbert 
h"ve  investigated  several  so-called  “6-ness  criteria,”  which  are  numerical  measures  of  the  difference  between  A(r,r0) 
and  a delta  tuncuor..  One  such  measure  is 


d = 


[s 


(r  - r0) 


dr 


with  the  condition  that 


(22) 


(23) 


The  dj  are  found  by  minimizing  Eq.  (22)  subject  to  the  condition  (23) 

The  criterion  in  Eq.  (22)  is  not  the  most  appropriate  one  to  use  in  the  travel  time  problem  because  the  data 
kernels  G,-(r)  in  the  expression  (20)  are  not  square  integrable.  However,  the 

G,(r)  = ^ G,(r)  dr 


are  square  integrable  and  so  an  obvious  extension  of  Eq.  (22)  is 


f //(r  - r0)  - f 
j L 


dr 


A = 12  | | H(r  - r0)  - 1 A{r,  r0)  drj 

where  H(r  - r0)  is  the  unit  step  function.  Suppose  A (r,  r0 ) were  the  “box  car”  function-that  is, 


A(r,  r0)  * 1/8 
= 0 


lr  - r0  I < 8/2 
lr  — r0  i > 8/2 


(24) 
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then  4 = B.  Thus,  we  see  that  A is  a measure  of  the  width  of  our  averaging  function  A{r,  r0),  which  we  will  call 
the  “spread.”  The  criterion  Eq.  (24)  is  an  example  of  a linear  “quelling”  of  a discontinuous  linear  functional 
described  by  Backus  [1970] . Once  we  have  found  the  constants  from  the  minimization  of  Eq.  (24)  subject  to 
the  condition  Eq.  (23)  we  can  calculate  the  smoothed  or  averaged  version  of  m(r)  from  Eq.  (21).  if  m(r)  has  any 
fine-scale  structural  detail  with  wavelengths  smaller  than  our  measure  4 of  the  spread  of  the  averaging  function 
A(r,  r0),  we  will  not  be  able  to  resolve  it.  This  is  an  inevitable  consequence  of  the  finite  amount  of  data  we  have 
available  and  the  spread  4 is  a quantitative  measure  of  the  uniqueness  of  our  inversion. 

So  far,  in  discussing  resolving  power,  we  have  assumed  perfectly  accurate  data.  What  effect  do  the  observational 
errors  in  the  data  have  on  the  resolving  power?  If  the  data  have  observational  errors  A 7),  then  it  is  easy  to  see  from 
Eq.  (20)  that  these  errors  produce  an  error  A <m,  A > in  our  averaged  value  of  m(r)  at  r0  given  by 


A < m,  A > 


(25) 


It  is  clear  that  a highly  localized  average  of  m(r ) at  r0  is  not  very  useful  if  the  associated  error  in  tne  average  in 
Eq.  (25)  is  large.  We  would  be  willing  to  accept  an  average  with  a slightly  larger  spread  if  we  could  reduce  the 
error  in  Eq.  (25)  appreciably. 

Now  we  don’t  know  the  A 7)  exactly,  but  we  assume  we  know  something  about  their  statistics.  In  particular, 
we  assume  they  have  zero  mean  and  finite  variance  - that  is,  we  assume  that  the  covariance  matrix 


fy-EjAJiAT)} 


exists  where  E|  • } means  expected  value.  We  take  the  square  root  of  the  variance  of  A <m,A  > as  an  estimate  of 
the  error  we  make  in  calculating  < m,  A > with  erroneous  data,  call  it  e . ”’e  nave 


i i 


(26) 


or  in  linear  operator  notation 


e*  ■ a • E • a 


(27) 


For  a precise  definition  of  the  notation  see  Backus  and  Gilbert  [1970] . 

Similarly,  after  some  algebraic  manipulation,  we  can  write  Eq.  (24)  si 

4»a*S,a-2r*a  + Jk  (28) 

where  ^ 

Sy  • J G<(r)  G/(r)  dr 

#f‘ 

T/  - J H(r-r9)Gf(r ) dr  (29) 

o 

k ■ 1 - r0 
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and  the  condition  (23)  as 


a • G = 1 


(30) 


Our  object  is  now  to  find  the  a that  minimize  both  the  spread  given  by  Eq.  (28)  and  the  error  in  the  average  of  m(r) 
given  by  Eq.  (27)  while  satisfying  the  condition  (30).  Clearly,  we  cannot  minimize  Eqs.  (28)  and  (27)  with  the  same 
set  of  a but  Backus  and  Gilbert  [1970]  show  that  we  can  minimize  a linear  combination  of  the  two.  Consider  the 
combination 


q - 4 cos  0 + e2  sin  0 


(31) 


where  0 is  a parameter  that  runs  from  0 to  ir/2.  When  0 = 0 we  are  minimizing  spread  and  when  0 = it  12  we 
are  minimizing  the  error  hi  the  average  of  m(r).  Backus  and  Gilbert  [1970]  prove  that  as  0 goes  from  0 to  rr/2 
the  curve  e1  (4),  called  the  tradeoff  curve  of  error  versus  spread,  is  a monotonically  decreasing  function  of  4 ; that 
is,  we  can  lower  the  error  in  the  estimate  of  <m.  A > by  willingness  to  accept  a larger  cpicad. 

Minimizing  Eq.  (31)  leads  to  a system  of  equations  for  the  a of  the  form 


Q • a = 1 2 T cos  0 


(32) 


where 


Qy  = 12Sjycos0  + Ey  sin  0 (33) 

There  will  be  a different  set  of  a for  each  value  of  0 on  the  tradeoff  curve.  Once  again,  the  system  Eq.  (32)  would 
be  much  simpler  to  solve  if  Q were  a diagonal  matrix,;  and  once  again  we  cm  use  the  diagonalizing  transformation 
developed  by  Gilbert  [1971]  to  simultaneously  diagonalize  S and  E. 

To  summarize  briefly,  at  each  radius  r0  where  we  wish  to  calculate  <m,  A > we  minimize  Eq.  (31)  subject 
to  the  condition  (30)  to  determine  the  a’$  at  each  point  on  the  tradeoff  curve.  At  each  point  on  the  tradeoff  curve 
at  r0  , we  calculate  the  spread  4 by  Eq.  (28)  and  the  error  in  <m,  A > at  r0  by  Eq.  (27). 


RESULTS 


Most  of  the  results  of  the  inversion  and  resolving  power  calculations  using  body  wave  travel  time  data  will  be 
published  elsewhere.  However,  one  example  is  given  to  illustrate  the  method. 

Figure  1 shows  a starting  model  for  the  inversion  of  a set  of  data  consisting  of  70  P-w»ve  travel  times  from 
25°  - (1°)  - 94°  taken  from  Evemden  and  Clark  [1970] , and  30  PCP  travel  times  from  1 5°  - (2°)  - 73°  taken  from 
Herrin  et  al.,  [1968], 

Figure  2 shows  the  result  of  the  fifth  iterate  in  the  inversion  process  that  fits  the  data  to  within  one  standard 
error  (standard  errors  of  0.S  sec  were  taken  for  the  P and  PCP  data).  Evemden  and  Clark’s  travel  time  curve  (Ec  Sl-1 
in  their  paper)  consists  of  a series  of  matched  straight  line  segments.  As  a result,  there  is  considerable  detail  in  the 
final  model  which  fits  u •$  data.  The  perturbations  in  velocity  evident  in  figure  2 between  r/a  = .55  and  rla  ~ .85 
are  on  th*  order  of  2%  over  distances  of  100-200  km  or  greater.  The  question  now  is,  are  these  details  resolvable? 
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Figure  3 is  the  resolving  power  map  for  this  model,  which  shows  the  contours  of  spread  or  distance  over  which 
we  must  average  the  model  to  achieve  a desired  level  of  relative  error  in  the  estimate  of  the  velocity  at  any  given 
radius.  The  values  of  spread  are  given  in  fractions  of  an  earth  radius  For  example,  we  wish  to  know  if  the  detail  in 
figure  2 is  resolvable.  Since  these  velocity  jumps  are  on  the  order  of  2%  we  follow  a horizontal  line  across  figure  3 
at  the  1%  level  to  see  over  how  much  distance  the  velocity  must  be  averaged  to  achieve  such  an  accuracy.  As  we  see 
from  the  figure,  from  r/a  * 0.55  to  r!a  * 0.85  the  spreads  range  from  40  km.  to  about  130  km.  As  we  near  the 
surface,  the  spreads  get  much  larger,  approaching  640  km.  This  is  due  to  the  fact  that  only  P data  from  25 ’ on  out 
was  used,  and  these  rays  only  begin  to  bottom  at  about  460  km.  depth  in  this  model.  So,  if  we  average  the  model  in 
figure  2 with  the  spreads  we  obtain  between  r/a  * 0.55  and  r/a  * 0.85  in  figure  3 at  the  1%  level,  the  detail  will 
not  be  lost;  we  say  that  it  is  resolvable. 

Note  that  we  have  not  made  a statement  about  the  real  earth  yet.  Backus  and  Gilbert  [1968, 1970]  show  that 
only  if  the  real  earth  is  linearly  close  to  our  model,  in  ;he  sense  in  which  we  have  linearized  the  problem,  will  it  have 
the  same  linear  averages  as  out  model. 


Figure  1.-  The  tuning  model  for  P-velodty  in  the  mantle, 
a ■ radius  of  the  Earth. 


Figure  1-  The  fifth  iterau  in  the  inversion  process  obtained 
from  figure  1. 


Figure  4 is  a picture  of  the  step  functions  con- 
structed from  linear  combinations  of  data  kernels  u ing 
the  criterion  in  Eq.  (24).  The  step  functions  are  pictured  at 
various  points  from  0 = 0 to  0 - tr/2  along  the  tradeoff 
curve  at  r/a  * 0.7  for  the  model  shown  in  figure  2. 

Recall  that  near  0 = 0 we  are  minimizing  spread  so  the 
step  functions  are  very  steep  and  the  resulting  “delta 
functions”  would  be  very  narrow.  Near  0 * rr/ 2 we  are 
minimizing  error  so  that  the  step  functions  are  more 
gradual  and  the  resulting  “delta  functions”  are  wider; 
that  is,  they  have  more  spread. 


Figure  3.-  A contour  map  of  the  spread  of  the  optimal 
avenging  function  A (r.  r,)  as  a function  of  r,  and  the 
relative  error  in  < m,  A >.  Dm  contours  of  spread  are 
liven  in  fractions  of  an  earth  radius. 
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figure  4.  Step-functions  constructed  from  linear  combinations 
of  data  kernels  using  the  criterion  in  Eq.  (24).  The  step-functions 
are  pictured  at  various  points  from  9=0  to  9 - nil  along 
the  tradeoff  curve  at  r„  la  - 0.7  for  the  model  shown  in 
figure  2. 
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DISCUSSION 


Parker:  The  method  you  are  using,  because  you’ve  linearized  and  made  various  approximations,  doesn’t  depend 

strictly  on  the  data  available.  You  can  map  the  data  in  any  way  you  like  that  is  more  convenient,  as  you’ve 
done.  You  can  map  it  into  a more  convenient  form  to  avoid  triplications;  for  example,  you  can  exploit  the 
fact  that  you  kne  that  the  curve  of  the  matched  data  is  piecewise  continuous.  Thus,  you  incorpo.ute 
assumptions  and  remove  the  difficulties  of  interpretation.  Having  a general  method  available  means  you  aren't 
dependent  on  the  type  of  data  available.  You  can  map  it,  always  under  the  assumption  that  linearity  is  a valid 
assumption. 
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ABSTRACT 


The  theory  of  Backus  and  Gilbert  g.  \ a technique  for  solving  the  general  linear  inverse  problem.  Observa- 
tional error  and  lack  of  data  are  shown  to  reduce  the  reliability  of  the  soiution  in  different  ways:  the  former 
introduces  statistical  uncertainties  in  the  model,  while  the  latter  smooths  out  the  detail.  Precision  can  be  improved 
by  sacrificing  resolving  power,  and  vice  versa,  so  that  ;ome  compromise  may  be  made  between  the  two  in  choc  ing 
the  best  model.  Nonlinear  inverse  problems  can  be  brought  into  the  domain  of  the  theory  by  linearizing  about  a 
typical  solution. 


The  inverse  problem  of  electrical  conductivity  in  the  mantle  is  used  to  illustrate  the  Backus-Gilbert  technique; 
an  example  of  the  tiadeeff  diagram  is  given. 


INTRODUCTION 


The  most  general  theory  for  handling  inverse  problems  that  we  have  today  is  that  of  Backus  and  Gilbert  [Backus 
and  Gilbert  1967, 1968, 1970;  Backus  1970a, 6, c] . This  work  has  gained  the  reputation  of  being  mathematically 
very  difficult.  However,  it  is  not  the  basic  concepts  that  are  difficult.  Rather,  to  make  their  exposition  rigorous  and 
general,  the  original  authors  needed  powerful  mathematical  weapons  that  are  not  in  the  armory  of  the  average  geo- 
physicist. The  situation  is  analogous  to  that  of  quantum  mechanics;  Von  Neumann  rigorized  the  subject  by  usi  g 
ihe  theory  of  unbounded  operators  and  spectre!  theory  in  Hilbert  space,  but  undergraduates  grasp  the  basic  ideas  of 
quantum  mechanics  without  this  apparatus.  Backus  ai:d  Gilbert  also  use  Hilbert  space  u»  develop  their  theory,  but 
we  shall  nut  need  any  such  profound  mathematics  here. 

What  is  an  inverse  problem?  Suppose  we  have  made  measurements  of  the  magnetic  field  of  the  earth.  What 
does  this  tell  us  about  the  magnetization  of  surface  rocks?  Or  given  a set  of  normal  mode  frequencies  (the  frequencies 
of  free  oscillation  of  the  earth  observed  after  the  largest  earthquakes),  we  would  like  to  find  the  density  and  seismic 
velocities  as  a function  of  radius.  These  are  typical  geophysical  inverse  problems;  they  are  attempts  to  infer  properties 
of  the  interior  of  the  earth,  which  is  inaccessible  to  us,  from  measurements  made  at  the  surface.  The  general  procedure 
in  such  problems  is  to  construct  a model  of  the  earth  that  possesses  the  same  external  observables  as  those  measured 
in  practice.  With  modern  computers  this  is  no  longer  a difficult  task.  The  real  difficulty  is  in  knowmg  how  well  the 
data  determine  the  property  we  are  seeking.  For  example,  it  is  well  known  that  measurements  of  the  external  gravity 
field  of  the  earth  do  not  determine  uniquely  the  density  within  it:  there  are  infinitely  many  structures,  widely  differ- 
ing from  each  other,  which  all  give  rise  to  the  same  field.  This  is  the  sort  of  problem  that  Backus  and  Gilbert  have 
solved  and,  what  is  more,  to  which  they  have  brought  a great  deal  of  insight. 

THE  BACKUS-GILBERT  TECHNIQUE 
The  Data 

We  first  define  the  observations  to  be  the  N real  numbers  jj  ~ 1 , 2, , ...  A,  where  A'  is  finite.  Clearly,  in 
the  normal  mode  example,  we  will  always  have  a limited  number  of  measurements.  But  what  about  the  case  of 
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magnetic  data  recorded  on  a strip  chart?  There  we  have  a continuous  function.  How  can  we  pretend  that  an  adequate 
representation  is  obtained  by  N numbers?  There  are  two  ways  to  answer  this  objection.  The  first  is  the  frivolous 
suggestion  that  since  we  now  always  process  our  observations  on  a computer,  which  cannot  hold  an  infinite  number 
of  samples,  we  must  be  satisfied  with  something  less.  This  suggestion  dodges  the  issue  because  we  have  appealed  only 
to  the  practice!  nature  of  data  processing,  one  might  ask  whether  continuous  records  siu/uld  be  digitized  so  closely 
that  the  number  of  samples  is  as  large  as  the  available  computer  memory  permits. 

It  is  more  pertinent ' o appeal  to  the  practical  nature  of  measurement.  All  recording  instruments  have  a limited 
bandwidth,  and  all  records  are  finite  in  length.  It  is  clear  that  our  continuous  chart  recording  could  be  Fourier  ana- 
lyzed (there  is  no  problem  about  existence  of  the  Fourier  series  here),  and  coefficients  corresponding  to  frequencies 
higher  than  the  instrument  cutoff  must  be  useless.  Thus,  a finite  number  of  numbers  ( the  N accepted  Fourier 
coefficients)  represent  the  original  continuous  curve  adequately  and  can  be  used  as  the  data  y j.  In  time-series  ana- 
lysis, this  is  called  choosing  the  proper  Nyquist  frequency  to  sample  all  the  information. 


DELTA  FUNCTIONS 


Suppose  we  now  have  our  data,  and  for  the  present  assume  they  are  exact-no  experimental  error.  Let  us  take 
the  problem  of  determining  a property  in  the  earth  as  a function  of  radius,  assuming  no  angular  dependence.  Further- 
more. we  initially  consider  a linear  inverse  problem;  this  means  the  observable  depends  in  a linear  way  on  the  property. 
Under  these  assumptions  we  can  write 

a 

7/  = f m (r)  Gj{r)  dr  j - l,  2 N 


where  m(r)  is  the  property  and  Gy(r)  is  called  a data  kernel,  one  for  each  observable.  A simple  but  instructive 
example  is  to  make  m(r)  the  density  and  yj  the  radial  component  of  gravity  at  radius  Rj  from  the  center  of  the 
earth  (Rj>a).  Then 


f 4 wGr*  , * J 

yi  - [ ~rF  f(r>  ir 


where  G is  Newton’s  gravitational  constant.  We  see  at  once 


„ _ 4nGrJ 

’’ " ~*r 


and  the  problem  is,  of  course,  linear. 

Now  let  us  return  to  the  more  general  radial  property  m{r).  The  only  thing  we  know  about  m is  the  set  of 
measurements  yj.  How  can  we  localize  this  information  to  points  within  the  earth?  Backus  and  Gilbert  make  the 
following  suggestion;  consider  a linear  combination  of  yj  given  by 
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L = 


A' 

2 r, 

/=1 


From  our  linearity  assumption  we  have 


- f [i  w 


m(r)  dr 


Now  suppose  we  could  choose  the  constants  ay  so  that  the  function  in  square  brackets  was  a Dirac  delta  function 
centered  on  r0  . Then  L would  simply  be  m{r0).  the  property  we  want  at  the  position  r0  . In  general,  it  is 
impossible  to  get  ideal  delta  functions,  but  by  choosing  our  coefficients  ay  carefully  we  might  be  able  to  find  a 
function  that  is  concentrated  strongly  at  r0  . How  do  we  choose  ay  to  provide  approximations  to  a delta  function 
at  various  radii  ? Backus  and  Gilbert  define  a numerical  measure  of  “difference  from  a delta  function"  and  mmimi/.c 
this  number  by  varying  the  coefficients  ay.  For  example,  they  investigated  these  two  measures: 


D,  = 


Dr  = 


a 

| |6(r  - r0)  - f(r)J  dr 
0 

a 

\ F(rf  (r  - r„)>  J, 


r 

with  I F(r)  dr  = 1.  The  measure  £>,  is  essentially  a mean  square  deviation,  the  other  is  a more  sophisticated 

Jo 

criterion,  but  I think  one  can  see  that  if  or  D2  can  be  made  sma!1,  the  function  F(r)  will  have  a large  peak  at 
r0  and  unit  area  under  it.  Other  criteria  are  possible  but  the  two  above  have  proved  useful  because  they  yield  simple 
equations  for  the  coefficients  ay. 

Having  decided  on  a criterion  and  performed  the  minimization  we  will  obtain  for  every  radius  r0  a set  of  ay 
and  corresponding  to  it  an  estimate  of  the  property  m(r0)  given  by 


N 


<m(r0)> 


V 


So  the  information  we  gain  about  the  property  m is  a smoothed  version  of  the  actual  structure;  the  real  profile  has 
been  convolved  with  the  best  “delta  function”  we  can  find  from  linear  combination  of  the  Gy.  if  there  is  detailed 
structure  that  has  wave  ’engths  less  than  the  width  of  our  “delta  function”  at  a particular  depth,  our  measurements 
cannot  reveal  it.  In  other  words  the  fin'te  number  of  data  (perfectly  accurate  tho"°h  they  are)  give  us  a blurred 
picture  of  the  earth.  We  have  thus  discovered  the  extent  to  which  our  inversion  is  unique. 
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Returning  now  to  the  example  of  gravity,  we  can 
see  that  measurements  at  different  /?.-  yield  no  new 
information  about  p(r),  because  any  linear  combina- 
tion of  the  Gj  is  still  proportional  to  r1 * 3 . This  is  the 
best  (and  only!)  approximation  to  a delta  function  that 
we  can  make.  Thus,  rather  trivially,  we  have  confirmed 
the  statement  made  earlier:  any  radially  structured 
model  with  the  correct  mass  satisfies  the  data.  Suppose 
now  we  include  the  moment  of  inertia  as  a datum  72  , 
while  y,  is  a single  gravity  measurement: 


i 8tr  * 

7i  = 1 — r*  p(r)dr 

Jo  J 

so  this  datum  is  linear  in  p . Without  going  into  details 
we  can  see  that  linear  combination  of  the  two  data  ker- 
nels will  allow  us  some  sort  of  approximation  to  a peaked 
function.  Using  the  Z),  criterion  1 calculated  some 
curves  (see  fig.  1).  We  are  not  getting  very  good  resolu- 
tion with  these  data.  If  experiments  were  available  that 
gave  data  whose  kernels  were  proportional  to  r6 , r* , 
r10 , etc.,  it  can  be  proved  that  arbitrarily  good  delta 
functions  can  be  constructed  from  them  by  means  of 
the  Weierstrass  approximation  theorem  [Counmt  and 
Hilbert,  1966] . Unfortunately  these  data  do  not  exist! 

The  Role  of  Experimental  Error 


So  far  all  our  data  have  been  precise:  we  have  seen 
that  with  only  a finite  number  of  measurements  our 
knowledge  must  be  imperfect  even  in  this  ideal  situation. 

In  practice,  each  of  the  measurements  will  be  subject  to 
error,  whose  rough  size  the  experimenter  usually  estimates. 
The  property  averages  <m(r0)  > are  simply  linear  combinations  of  the  data  7.-  so  that  it  is  easy  to  find  the  associ- 
, ated  errors  of  their  values.  Now  we  have  two  measures  of  imprecision  both  of  wnich  we  should  like  to  make  as  small 
as  possible- the  error  of  the  estimate  and  its  resolution  (defined  to  be  the  half  width  of  the  delta  function  for 
example).  Can  the  ay  be  chosen  to  minimize  both  at  once?  In  fact  this  is  impossible:  some  compromise  must  be 
accepted.  Let  us  use  the  D2  criterion  for  delta  function  quality;  it  is  possib!;  to  define  the  resolution  width  s to 
be  equal  to  i2£>2  (this  has  dimensions  of  length).  After  some  manipulation  one  can  show  that 


Figure  1.-  Three  delta  functions  for  density  using  gravity 
and  moment  of  inertia. 


1 1 


where  5,y  is  a matrix  that  depends  on  the  Gj  and  r0 . Similarly,  the  error  estimate  e of  <m(r0)  > is  given  by  a 

quadratic  form 


e 


? _ 


/ 


7-24 


where  ry  [ins  standard  error  Oy  (it  is  assumed  here  that  the  errors  are  statistically  independent).  Since  we 
cannot  minimize  e2  and  s with  one  set  of  ay,  let  us  agree  to  minimize  instead  a composite  quadratic  form 

Q(aj)  = (1  - w)s  + we2 

whi  * e ' ’ ,s  a weighting  factor  that,  when  0,  gives  all  the  weight  to  resolution  (and.  of  course,  yields  the  solu- 
tior  we  . .cussed  earlier),  and  when  I.  gives  all  the  we.ght  to  error.  Backus  and  Gilbert  [1970)  show  that  as  «■ 
vanes  from  0 to  1,  and  we  minimize  Q by  choosing  the  appropriate  set  of  ay.  the  resolution  length  increases 
and  he  e.  ror  decreases.  In  other  words,  one  can  improve  the  error  estimate  of  a property  average  by  sacrificing 
ns  resolving  power  and  vice  versa.  The  “best"  choice  of  w is  impossible  to  give:  it  depends  on  what  one  is 
tiymg  to  lay  about  the  earth.  Therefore,  to  solve  a real  problem  completely  one  must  give  the  relationship 
between  C'or  and  resolution  at  every  radius. 

The  ituation  outlined  above  is  analogous  to  that  described  by  the  quantum  mechanical  uncertainty  princi- 
ple; it  is  i .possible  to  giu  a perfectly  accurate  property  value  at  a perfectly  defined  position.  We  can  improve 
o le  at  the  ex  per  se  of  the  other,  but  in  the  case  of  linear  inverse  theory  neither  the  accuracy  nor  the  resolution 
can  be  refined  indefinite!) : the  inadequacy  of  our  data  puts  bounds  on  both.  We  shall  look  at  an  example  of  a 
"tr  ideoff*  diagram  in  discussing  the  electrical  conductivity  problem. 

Nonlinear  Extension 

Most  (but  not  all)  interesting  geophysical  inverse  problems  are  nonlinear.  The  observable  data  are  nonlinear 
functi  >nals  of  the  earth  p operty  in  question,  and  consequently  there  is  usually  no  explicit  relation  between  7y 
and  n (r)  Nonlinear  analysis  is  notoriously  difficult,  and  the  solution  proposed  by  Backus  and  Gilbert  is  simply  to 
linearize  the  prob'em.  A model  of  the  earth  is  found  that  fits  the  data  well  (how  this  is  done  will  not  concern  us): 
call  it  and  the  observables  it  predicts  7-(°)  • Then  it  can  be  shown,  by  a process  rather  like  Taylor  series 

expan  on.  that  it  is  usua  ly  possible  to  write 


Yy  ~ 7y^3)  = £m(r)  - w^°)(r)j  Gy(r)  dr  + 0^  jm(r)  - m(°)(r)j  dr 


If  the  second  term  is  neglected,  this  is  a new  linear  problem  with  7y  - 7y(°)  as  the  A observables  and  m(r)  - 
nt(°)(r)  as  the  required  property. 

Such  linearization  is  ci  ;ar‘y  an  approximation,  and  even  if  it  is  a valid  one,  we  have  no  guarantee  that  a “base" 
model  other  that,  m^°Hr)  does  not  exist  that  is  outside  the  scope  of  the  linear  description.  Nonetheless,  a linear 
description  of  the  unce.  nties  in  inverse  models  gives  us  a much  clearer  understanding  than  we  had  before. 


THE  ELECTRICAL  CONDUCTIVITY  PROBLEM 


Physical  Background 

What  are  ‘.he  surface  irCuences  of  electrically  conducting  material  at  great  depths  within  the  earth9  Geomag- 
netic variations  are  modified  by  the  presence  of  a conductor  in  the  following  way;  magnetic  field  fluctuations  cause 
eddy  currents  to  flow  in  the  conductor,  and  these  currents  themselves  give  rise  to  magnetic  fields.  Remarkably, 
potential  thee  -«  g>.es  a method  whereby  a surface  observer  can  distinguish  between  the  driving  field,  caused  outside 
the  earth,  ana  the  induced  field  appearing  in  response  to  the  first.  The  method  requires  the  surface  field  to  be 
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FR£Cl£NC*  *,  m cycles  per  doy  (I09  scale) 

Figure  2.-  Thirty-five  data  points  taken  from  Banks  1 1969) 
showing  the  amplitude  ratio  of  the  vertical  to  horizontal 
field  in  the  frequency  range  0.2  to  0.01  cycle  per  day.  Solid 
line  result  of  theoretical  iterative  model. 
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Figure  3.-  Theoretical  conductivity  model  used  to  fit  the  data 
of  figure  1. 


decomposed  into  spherical  harmonics,  it  has  been  found 
that  the  driving  field  consists  largely  of  an  E = 1 harmonic 
(a  uniform  field)  at  the  longer  periods,  and  a physical  me- 
chanism is  available  to  account  for  this  observation. 

To  obtain  information  about  the  conductivity  from 
10,'  km  to  1000  km  it  is  necessary  to  record  very  slow 
variations  in  the  field  from  about  1 cycle  per  day  to  1 cycle 
per  year.  In  this  frequency  band  there  are  periodic  and 
aperiodic  sources,  both  of  which  can  be  used  for  inversion 
purposes.  But  the  amplitudes  of  the  variations  are  very 
small,  often  only  a fsw  parts  in  10s  when  compared  with 
the  steady  field,  so  that  in  practice  the  application  of 
cross  correlation  and  other  statistical  techniques  is  vital. 

The  theory  is  made  much  easier  if  we  assume  the 
earth  is  spherical  and  that  the  conductivity  varies  only  as 
a function  of  radius.  This  approximation  is  clearly  rather 
poor  near  the  surface,  where  there  are  oceans  and  other 
irregularly  distributed  conductors.  By  concentrating  on 
the  lower  frequencies  we  hope  to  “see  through”  the  sur- 
face layer  and  observe  a region  where  the  assumption  is 
valid.  With  the  further  assumption  that  the  source  field 
is  uniform,  it  turns  out  that  the  key  observation  is  the 
ratio  of  the  amplitudes  of  the  vertical  and  northward 
horizontal  fields  considered  as  a function  of  frequency. 

This  quantity  should  vary  with  position  like  the  sine  of 
the  geomagnetic  latitude;  that  it  does  for  a number  of 
stations  [Banks,  1969]  bears  out  the  validity  of  the  two 
assumptions.  The  geomagnetic  data  published  by  Banks 
were  used  in  the  application  of  the  Backus-Gilbert  theory 
to  the  electrical  conductivity  problem.  These  data  cover 
the  range  1 cycle  pe:  day  to  1 cycle  per  year  from  which 
the  frequency  band  0.2  cycles  per  day  to  0.01  cycles  per 
day  was  extracted  for  purposes  of  the  work  under  dis- 
cussion. In  this  interval  Banks  gave  35  frequencies  at 
which  he  estimated  the  amplitude  ratio  of  the  vertical  to 
horizontal  field  (normalized  by  the  sine  of  the  latitude). 

Of  course,  this  ratio  has  a phase  as  well  as  a magnitude, 
but  the  former  had  a much  larger  uncertainty  and  was 
not  included  in  my  analysis. 

Application  of  Backus-Gilbert  Methud 

Mathematical  details  required  in  finding  the  data- 
kemels  for  electrical  conductivity,  which  is  a nonlinear 
problem,  are  given  by  Parker  [1970] . The  35  data  described 
above  and  their  standard  errors  are  shown  in  figure  2.  An  itera- 


tive scheme  not  discussed  in  this  article  was  used  to  generate  a 
theoretical  model  that  fits  the  observations  to  within  about  one  standard  error:  the  continuous  line  in  figure  2 shows 
the  response  of  the  model,  which  is  shown  in  figure  3.  Such  a simple  curve  is  not  an  adequate  description  of  the  model, 
and  the  ambiguities  due  to  lack  of  resolution  and  lade  of  precision  are  of  considerable  importance. 
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A “tradeoff'  contour  diagram  is  shown  in  figure  4. 

Spread  (the  length  over  which  the  real  conductivity  is 
smoothed  to  obtain  an  estimate)  is  contoured  against 
relative  error  and  position  in  the  earth.  The  Z)2  criter- 
ion described  earlier  was  used  here.  The  diagram  is 
arbitrarily  terminated  at  the  top  since  errors  larger  than 
100%  are  not  very  interesting,  but  the  bottom  of  the 
diagram  is  the  line  of  minimum  error;  no  matter  what 
resolution  we  may  tolerate  this  is  the  smallest  error  pos- 
sible. If  the  relative  error  level  of  0.2  or  20%  is  traced 
across  the  diagram,  it  will  be  seen  that  the  spread  is  un- 
reasonably large  until  r/a  = 0.8  or  so.  Below  this  depth 
our  estimate  is  a virtually  meaningless  average  over  the 
whole  mantle,  made  even  more  valueless  by  the  fact  that 
the  linearisation  approximation  breaks  down  for  such 
large  ranges  of  a.  This  simply  shows  that  we  know 
nothing  about  conductivity  below  r/a  = 0.8. 

Continuing  the  20%  level  across,  we  can  see  that 
resolution  lengths  improve  to  0.07a  or  so  for  rja  between 
0.8S  and  0.95.  If  we  would  like  a more  precise  estimate 
than  one  with  20%  error,  we  must  take  a larger  averag’ng 
length  as  shown. 

The  purpose  here  is  to  demonstrate  how  the  nonuniqueness  information  generated  by  the  Backus-Gilbert 
technique  is  used.  A geophysically  interesting  question  concerns  the  appearance  of  a conductivity  minimum  at 
r/a  = 0 75.  The  tradeoff  diagram  shows  that  the  minimum  is  not  a oelievable  feature  of  the  model.  In  a case 
like  this,  the  errors  ascribed  to  a solution  are  as  important  as  the  solution  itself. 

Discussion 

In  recent  years  our  inverse  problem  has  received  some  ingenious  analytical  attention;  Bailey  [1970)  gives  a 
direct  solution  based  cn  the  application  of  Kronig-Kramers  relations  to  the  Ricatti  equation,  and  Weidelt  1 1 970) 
achieves  the  same  objective  with  the  Geljfand  and  Levitan  [1951]  procedure  of  qua  turn  mechanics.  Both  tech- 
niques assume  the  availability  of  data  that  are  perfectly  accurate  and  cover  all  frequencies.  Before  these  results 
were  known,  of  course,  iterative  model  building  was  used. 

i have  given  here  a very  brief  sketch  of  the  application  of  the  Backus-Gilbert  approach.  1 hope  I have  made 
it  plain  why  it  is  necessary  to  go  beyond  the  production  of  a model  fiti  ng  the  data  to  a description  of  the  uncer- 
tainties; 1 have  indicated  the  form  of  the  description  in  terms  of  a trade-off  diagram,  and  given  examples  of  its 
use.  The  Backus-Gilbert  method  is  a general  theory  and  1 hopr  -/e  shall  see  a wider  use  of  it  in  the  future. 


DISCUSSION 


Newton:  You  seem  to  be  implying  that  the  main  limitation  on  the  resolving  power  comes  from  the  number  of 

observations.  There  is  another  problem  that  has  to  do  with  the  linear  dependence  jf  the  observations.  Your 
example  is  a vivid  example  of  just  that.  It  is  implied  by  the  theory  that  two  observations  are  not  going  to  give 
you  any  more  than  a single  observation.  In  another  case  you  may  not  be  able  to  see  that  so  easily. 

Parker:  This  is  very  true.  But,  if  you  go  through  the  procedure  carefully  it  comes  out  automatically. 

Unidentified  speaker:  Could  you  use  this  theory  to  plan  your  experiment? 

Parker:  Yes.  It  can  be  used  in  that  way,  especially  in  a problem  like  the  conductivity  problem  1 discussed.  Another 

case  is  satellite  planning. 

Lax:  You  mentioned  data  of  the  order  of  S00  or  1000.  In  that  case  there  are  some  numerical  problems  in 

minimizing  a quadratic  form  under  a linear  restriction.  Can  you  comment  on  that? 

Parker:  This  is  a ve*y  important  problem.  The  problem  of  linearizing  very  large  quadratic  forms  is  exactly  the 

same  problem  as  diagonalizing  a large  matrix.  We  try  to  overcome  the  difficulties  by  examining  the  spec'v  .m 
of  the  eigenvalues  you  get.  This  is  feasible. 

Sondhi:  How  do  the  results  depend  on  (he  criterion  you  use  for  the  concentration  of  the  linear  combination  of 

delta  functions? 

Parker:  This  has  not  really  been  explored. 

Sondhi:  Is  there  any  difficulty  in  extending  this  when  the  C’s  are  known  only  at  discrete  values  of  r?  You  have 

a certain  number  of  kernel  functions.  What  if  you  know  the  values  of  the  samples  of  the  kernel  function  only, 
and  not  the  continuous  function? 

Parker:  I think  one  could  formulate  it  that  way.  We  haven’t  in  the  geophysical  sciences. 

King:  I think  you’ve  shown  beautifully  the  limits  of  information  retrieval  using  linear  inversion  techniques.  This 

is  basically  due  to  the  fact  that  there  is  a spread  in  your  weighting  function  that  no  amount  of  lir>  ar  games- 
manship in  terms  of  manipulation  will  surmount.  This  same  restriction  does  not  seem  to  apply  to  nonlinear 
combinations  of  the  observations.  I think  there  is  a limit  in  principle. 

At  jtr:  I think  you  are  quite  correct.  If  the  inverse  problem  is  linear  you  will  gain  nothing  by  using  nonlinear 

averages.  If,  however,  the  inverse  problem  is  nonlinear  there  must  be  available  dettper  insight  using  nonlinear 
theory. 
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This  paper  considers  some  practical  numerical  matters  [Gilbert.  1971  j . This  is  in  effect  an  extended  aiiswei  to 
the  question,  “What  do  you  do  when  you  have  a large  data  set?"  We  first  demonstrate  the  inversion  of  a large  data 
set.  with  its  errors,  »nd  then  discuss  the  tradeoff  curve,  or  resolving  power  calculations.  We  consider  inverse  problems 
of  the  earth,  represented  in  the  idealized  foim  of  a single  scalar  function  of  a single  coordinate  m.  (.v|.  A model  of 
the  earth  that  may  or  may  not  fit  the  data  is  m(.v).  Foi  each  model,  we  calculate  a gross  earth  functional  gj.  which 
is  some  observable  quantity  that  depends  on  the  whole  earth  or  some  fraction  of  the  whole  earth;  7 is  the  observed 
value  of  g;.  the  outcome  of  some  experiment  or  set  of  experiments,  and  Gj  is  the  Frechet  kernel,  or  the  first  deriva 
live,  which  permits  the  effect  6 gj  of  a small  change  in  the  model  bm  to  be  computed; 


I 


6m(x)Gj(x)  dx  = 6 gj 


U) 


In  practice,  almost  all  geophysical  data  are  not  Frechet  derivatives  but  ordinary  differentials,  and  we  consider  only 
ilv'se. 

In  formulating  inverse  problems,  we  want  to  turn  this  relationship  around  and  think  of  6 gj  as  being  the  differ- 
ence between  the  functional  for  the  real  earth  and  the  functional  for  some  model  We  may  regard  the  model  as  a 
starting  model  or  as  a model  in  some  stages  of  an  iterative  procedure,  so  that  the  functional  for  the  real  earth  is  the 
observed  value  of  7 It  is  the  difference  between  the  observed  value  7 ( and  the  calculated  value  of  gj  that  we  call 
6?r 

*>gi  = gi(mE)  ~ gj(m) 


= 7/  ~ gj(" 0 


(-) 


We  know  the  model,  we  don’t  know  the  real  earth,  and  we  seek  to  estimate  this  difference.  It  is  clear  that  we 
have  linearized  the  statement  of  the  problem,  and  because  we  only  have  a finite  number  of  data  that  represent 
linearized  moments  and  the  difference  6m(: c),  the  answer  cannot  possibly  be  unique  except  under  very  pathologi- 
cal circumstances.  This  is  both  good  and  bad.  It  is  bad  because  we  will  never  know  what  the  leal  earth  is  really  like, 
but  is  is  good  because  we  can  explore  this  line  of  uniqueness  to  find  a model,  any  model,  at  the  present  time. 


For  example,  we  can  regard  the  linear  relationship  between  Sg  and  6m  for  all  the  data  as  being  a side  condi- 
tion. or  constraint,  and  make  the  demand  that  we  want  to  find  the  6m  that  is  smallest  in  an  rms  sense: 


(3) 


v 


We  wish  to  minimize  this  quadratic  form,  and  use  those  linear  relationships  between  the  differences  in  the  data  and 
the  differences  in  the  mod»l  as  side  conditions.  This  is  a classical  isoperimetric  problem  in  the  calculus  of  variations, 
and  has  a very  simple  solution.  For  this  particular  minimum  condition,  the  Lagrange-Euler  equations  turn  out  to  be 
algebraic,  which  is  always  nice. 
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Two  other  criteria  may  be  used:  the  flattest  model  of  perturbation  that  fits  the  data 


min 


(4) 


or  the  smoothest  model  of  perturbation  that  fits  the  data 


min 


dx 


(S) 


From  -n  esthetic  point  of  view  we  find  that  the  criterion  (5)  is  the  one  that  gives  us  the  nicest  results,  but  any  one  can 
be  chosen  If  we  use  the  first  criterion,  for  the  smallest  rms  6m,  the  result  of  solving  the  Lagrange-Euler  equation  is: 


6 m(x)  •- 


£»/e/w 


(6) 


where  the  v are  the  Lagrange  multipliers  we  put  in  to  handle  the  side  conditions.  We  find  in  this  case  that  the  per- 
turbation is  just  a linear  combination  of  the  Frechet  kernels.  This  may  be  substituted  back  into  Eq.  (1)  to  give: 


if 


Gj(x)Gf(x)Vf  dx 


(7) 


So  far,  I have  assumed  that  there  are  no  errors  in  the  data.  The  kernel  Gfij  is  a symmetric  matrix  that  is  at  least 
positive  semidefinite,  and  is  positive  definite  if  you  are  lucky.  The  expression  in  Eq.  (7)  is  an  inner  product  of  a very  simple 
kind.  Thus,  these  values  of  v,  may  be  found  by  solving  the  system  of  linear  equations.  Equation  (6)  allows  construc- 
tion of  6 m,  which  when  adaed  to  the  starting  model  gives  a new  model  whose  properties,  as  predicted  by  perturbation 
theory,  agree  with  the  properties  measured. 

Of  course,  since  the  inverse  problem  is  not  linear,  this  process  has  to  be  iterated,  and  we  have  found  that  when 
you  start  one  of  these  iterative  procedures  it  either  Mows  up  instantly  or  converges  very  rapidly. 

We  now  consider  the  effect  of  errors  in  the  data.  Suppose  each  datum  has  a standard  error  o,  and  we  require 
only  that  the  data  be  fitted  to  within  some  band,  say  one  or  two  standard  errors.  Once  again,  the  isoperimetric  prob- 
lem may  be  set  up,  with  the  side  conditions 


*ff 


O; 


r 


6 mQc)G/(x)  dx  < 6g|  + of 


(8) 


and  minimize  the  integral  of  (6m)1 . Once  again  6m  turns  out  to  be  a linear  combination  of  Frechet  kernels,  and 
we  have  to  solve 
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Gj(x)Gj{x)Vj  dx  < Sgj  + 


(l>) 


Now  it  is  more  difficult  to  solve  these  inequalities  than  it  is  to  solve  equations,  and  usually  some  kind  of  linear  pro- 
gramming technique  is  used.  Suppose,  however,  that  we  are  very  fortunate,  and  in  addition  to  having  a matrix  that 
is  not  only  symmetric  and  positive  definite,  we  have  a diagonal  matrix.  Then  solving  Eq.  (l>)  is  very  easy.  In  this 
case,  when  the  G is  orthonormal.  the  integral  of  (6  )2  is  easily  shown  to  be  just  the  sum  of  the  Lagrange  multi- 
pliers 


r 


(6  m)J  dx 


(10) 


To  minimize  tq.  (10).  we  need  only  make  each  of  the  Lagrange  multipliers  as  small  as  possible  3ut>'ect  to  the  side 
conditions,  which  are  now 


\i  = bgt  - a i < v.  < bgj  + Oj  = u 


(II) 


This  is  done  merely  be  taking  r * V!  if  (I  >0.  r = u if  u <0.  and  r = 0 otherwise. 

Because  of  the  ease  of  solving  these  inequalities,  it  is  evident  that  we  need  to  diagonalize  the  matrix 


= f Gj(x ) Gj 


(x)  dx 


(12) 


At  the  same  time  we  should  allow  for  the  possibility  that  each  datum  may  not  have  a single  standarc  error  associated 
with  it.  There  may  be  interactive  errors . . . covariances.  In  addition  to  diagonalizing  the  inner  product  matrix  G 
we  need  to  diagonalize  the  covariance  matrix  V ■ 

The  matrix  G will  always  be  symmetric  positive  semidefinite;  (/  will  always  be  symmetric  positive  semi- 
definite.  and  will  be  positi  'e  definite  if  the  experiments  are  properly  conducted.  Our  problem  is  the  classical  one  of 
diagonalizing  simultaneously  two  quadratic  forms,  both  of  which  are  symmetric,  and  one  of  which  is  positive  defi- 
nite. This  can  readily  be  done  [Whittaker,  1937.  pp.  178-183], 

Let  T be  the  diagonal  zation  transformation 


T - V T'  = E 
T-G  -Tr  = I 


(13) 


wl’cn-  E is  diagonal,  and  i is  the  diagonal  unit  matrix.  The  new  gross  earth  functionals  and  the  new  observables  are 
transformations  of  the  old  ones 
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(14) 


f = Tg 

<t> T • 7 

and  new  Frechet  kernels  F are  generated  by  linear  transformation  of  the  old  ones 


F(x)  = T • G(x)  (15) 

The  standard  errors  of  the  now  independent  data  are  just  the  square  roots  of  the  diagonal  entries  of  the  diagonal 
variance  matrix 


(16) 


The  relationship  between  the  perturbation  of  the  model  and  the  perturbation  of  the  data  kernels  is  now  given  in  terms 
of  the  new  Freche't  kernels  by 


Sfi  = *i  ~ fi  - 


8m(x)Fj(x)  dx 


(17) 


where 

t 

FfWFjix)  dx  = 5,y  (18) 

We  proceed  to  the  construction  of  T in  two  steps.  First,  we  Find  the  orthogonal  matrix  Q which  diagonalizes 
the  covariance  matrix  V. 


I 


V = Q-V‘Qr 


(19) 


In  most  cases  1/  will  already  be  diagnoa),  when  there  are  no  covariances.  A transformed  inner  product  matrix  H is 
then 


H = Q -G  • Q* 


(20) 


Second,  we  start  with  the  quantities  v which  are  the  square  roots  of  the  errors 

v,-J  * Vu  (21) 

and  form  a new  symmetric  matrix  4 by  dividing  the  rows  u . columns  of  H by  the  v 
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The  matrix  A may  now  be  diagonalized  by  an  orthogonal  matrix  R 


CD 


A * R • A • R'  (23) 

The  standard  errors  of  the  new  data  are  just  the  square  roots  of  the  reciprocals  of  A . Now  if  we  P the  following 
nondiagonal  matrix 

Pjj  = SjRjjVj 

then  the  transformation  that  diagonalizes  6 and  V simultaneously  is 

T * D Q 


The  new  error  matrix  is  just 


E = p • u . p'  « A~' 


The  problem  is  now  reduced  to  finding  the  two  orthogonal  mat  rice:  Q and  R,  which  in  most  cases  amounts 
only  to  finding  R.  The  procedure  we  use  for  diagonalizing  those  matrices  is  essentially  a classical,  stable  method,  due 
to  Householder  [Wilkins,  196S,  pp  290-3321 . We  will  return  to  that  after  considering  the  problem  of  tradeoff  curves. 

After  doing  the  construction  problem-creating  a model  that  tits  the  data  by  means  of  the  ideas  given  above 
one  presumably  needs  to  do  resolving  power  calculations,  as  discussed  in  the  preceding  paper  by  Parker.  We  consider 
his  matrix  for  the  “spread”  S and  the  covariance  matrix  V.  Then  W is  a weighted  sum  of  these  matrices 


W - S cos  0 + 1/ sine 


where  a tradeoff  curve  is  generated  as  S runs  from  0 to  it  12.  Generally,  we  want  tr  solve  some  system  of  linear 
equations  like 

W • A « U 


Now  if  both  S and  1/  were  diagonal,  the  equations  could  be  easily  solved.  Since  they  are  both  positive  semi- 
definite,  we  can  go  through  the  same  thing  again,  and  diagonalize  them,  and  turn  this  into  a system  of  independent 
equations. 

In  practice,  we  find  the  orthogonal  transformations  which  diagonalize  'ne  matrices  as  follows.  A matrix  such 
as  A in  Eq.  (23)  is  first  reduced  to  tridiagonal  form  by  Householder’s  method.  The  eigenvalues  of  the  tridiagonal 
matrix  are  the  eigenvalues  of  A and  can  be  found  by  Sturm's  bisection  method.  Finally,  the  eigenvectors  of  the 
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tridiagonal  matrix  are  found  by  Wilkinson's  [1965)  method  of  iterative  improvement,  and  are  transformed  into  the 
eigenvectors  of  A by  the  tridiaguna!iz->tion  transformation.  We  have  found  this  approach  to  be  extremely  stable  and 
rapid. 

Now,  to  solve  a system  of  symmetric  linear  equations  like  this,  about  (l/6)nJ  operations  are  needed.  If  p points 
are  required  to  generate  a tradeoff  curve,  then  (1  Ibrfp  + 0 (n*p)  operations  are  needed.  In  the  diagonalization 
method,  one  needs  (2/3 )n3  operations  to  tridiaganalize  and  another  (2/3 )n3  to  get  the  eigenvalues.  That  is  done 
once  for  the  entire  tradeoff  curve,  and  the  number  of  points  on  the  tradeoff  curve  gives  a further  0 (n2p)  operatior  . 

If  p is  large,  it  is  possible  that  the  number  of  operations  required  to  solve  the  system  of  linear  equations  will  be  larger 
than  the  cost  of  diagonalizing  once  and  for  all  for  each  tradeoff  curve  and  then  doing  n2  operations.  For  n g.  ter 
than  p that  is,  when  the  data  set  is  larger  than  the  number  of  points  needed  on  the  tradeoff  curve  then  the  utio  of 
operation;  is  pi  8 (in  our  applications  p is  about  25  and  n is  '00  nr  more).  In  other  wort's,  if  therc  are  more  than 
eight  points  on  the  traisoff  curve  it  is  more  efficient  to  diagonalize  frst  instead  of  solving  the  system  of  linear  equa- 
tions. That  is  for  a single  tradeoff  curve,  so  it  is  always  more  efficient  to  diagonalize. 

If  the  smoothness  criterion  ...  the  D7  criterion  discusses  by  Parker,  S must  be  rediagonalized  for  every  rn 
changes.  If  the  D\  criterionisused.no  r0  appears,  and  the  matrix  S tak.s  the  same  form  for  til  radii,  so  diagorah 
zation  is  done  once,  not  only  for  each  tradeoff  curve,  but  for  all  of  them.  If  there  are  C curves  then  t*-  diagonaliza- 
tion meth  id  is  pCI 8 more  evident  with  the  use  of  the  D ( criterion.  If,  for  example,  theie  are  24  points  on  a curve 
and  50  curves,  it  is  150  times  more  efficient,  which  is  not  trivial  financially.  To  do  the  equivalent  of  300  data.  Wi.n  24 
points  per  curve  and  50  curves,  takes  about  410  sec.  on  a CDC  6600,  which  is  not  expensive.  This  kind  of  calculation 
is  not  done  very  often. 

’’he  actual  procedure  we  use  is  to  make  the  model  fit  the  data  after  diagonalizing  everything  ...  the  constructive 
part  of  the  problem.  The  data  are  culled,  by  accepting  only  those  relative  standard  errors  that  are  less  then  100%.  In 
most  geophysical  problems,  the  effect  is  to  reduce  the  amount  of  data  from  about  300  to  about  30.  We  have  termed 
this  redundancy.  But  those  30  represent  the  projection  of  the  contributions  of  all  of  the  300  data  into  that  good  data 
set.  This  turns  out  to  be  an  extremely  efficient  and  rapid  procedure.  Typically,  when  a model  agrees  with  the  data 
within  a few  percent,  and  we  wish  to  get  it  to  agree  within  0.2%,  two  iterations  are  suTicient  for  300  data. 

In  doing  the  tradeoft  curves,  we  do  exactly  the  same  thing.  What  the  large  errors  do,  after  diagonalization  of 
these  300  forms,  is  essentially  allow  you,  at  an  extreme  price,  to  narrow  down  the  averaging  width  a little.  The  extreme 
price  is  enormous  errors.  <n  a practical  case  all  the  errors  greater  than  100%  are  discarded  and  we  forget  about  the  top 
part  of  the  tradeoff  curve. 

This  is  a v<  ry  simple,  practical  scheme.  We  have  spent  a lot  of  time  coding  these  matrix  operations  for  efficiency. 

I followed  all  the  recommendations  that  Wilkinson  has  made  in  his  book  on  the  algebraic  eigenvalue  problem,  plus  one 
or  two  of  our  own.  We  store  the  matrix  in  linear,  diagonal  form,  and  operate  successively  from  one  column  to  thr  next. 
So.  in  fact,  so  far  as  the  computer  is  concerned,  we  start  with  cell  1 , go  through  sequentially,  and  wind  up  with  cell 
number  n(n  + 1 )/ 2.  Computers  like  this,  so  the  efficiency  goes  up  enormously. 
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DISCUSSION 


Falcone:  What  is  the  form  of  the  Frechet  kernel  Gj? 

Gilbert:  It  is  a function  of  radius  that  may  have  one  or  two  step  discontinuities.  Of  course,  it  is  represented  as  a 

set  of  points  for  machine  calculations.  Normally  it  is  a linear  combination  of  squares  or  cross  products  of 
eigenfunctions. 

Unidentified  speaker:  Did  you  say  you  could  diagonalize  a positive  semidefinite  matrix'.’ 

Gilbert:  Actually,  from  the  point  of  view  of  the  computer,  these  matrices  are  extremely  singular.  What  is  done, 

with  the  Sturm  bisection  method,  is  to  find  the  largest  eigenvalue,  the  next  largest,  and  so  on.  until  one  with 
value  1 or  less  occurs.  Since  this  implies  a relative  error  of  100%,  the  calculation  is  terminated. 

Unidentified  speaker:  You  implied  that  you  could  diagonalize  a matrix  in  a finite  number  of  steps.  Doesn’t 

finding  the  eigenvalues,  the  roots  of  a polynomial,  take  an  infinite  number  of  steps? 


Gilbert:  In  practice,  of  course,  we  quit  when  the  relative  value  is  known  to  10  14 . That  figure  of (2/3)»3  referred 

to  the  number  of  steps  needed  to  tridagouuhze.  The  bisection  method  then  completes  the  process  rapidly. 

Moses:  Could  this  be  used  for  Fourier  transformation?  You  might  have  data  with  errors,  for  whL'r  you  seek  the 

Fourier  transform. 

Gilbert:  Yes.  As  a matter  of  fact,  you  could  certainly  look  at  the  problem  of  digital  Fourier  analysis  as  being  the 

inverse  problem  of  trying  to  find  the  spectrum,  given  the  time  function. 

Thellman:  Aside  from  the  question  of  the  size  of  the  standard  errors,  we  found  that  near  zero  eigenvalues  were 

associate  ./ith  too  much  covariance  between  points  in  the  model,  which  implies  too  much  resolution. 

Gilbert:  If  you  go  ahead  and  diagonalize  the  covariance  matrix  by  a simple  orthogonal  transformation,  you  are 

projecting  into  a new  orthogonal  space,  where  your  data  can  be  ranked  a .cording  to  standard  error.  If  you 
have  too  many  data  because  you  made  too  many  overlapping  measurements,  you  make  this  projection  and 
look  at  the  standard  error.  If  it  begins  to  build  up,  you  just  cut  off  the  data  set. 

Unidentified  speaker:  Adding  on  data  that  have  large  standard  errors  should  not  make  the  errors  wider  in  the  model. 

It  might  make  the  mathematics  easier  if  you  throw  it  away. 

Gilbert:  That  is  quite  true.  There  is  nothing  wrong  with  having  a completely  redundant  set  of  data.  In  fact,  one 

of  the  easiest  ways  in  a big  machine  piogram  to  substitute  a new  datum  for  an  old  datum  is  to  just  read  it  in. 
because  the  old  one  automatically  gets  shoved  down  in  the  eigenvalue  stack.  We  have  a redundancy  factor  of 
something  like  10  or  15-with  300  data,  we  come  up  with  20  or  30  significant  data. 
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The  inverse  problem  for  gross  earth  data  can  be  reduced  to  an  undertermined  linear 
of  the  first  kind.  Discussed  in  this  paper  is  a theory  for  computing  particular  solutions  to 
the  stochastic  inverse  theory  presented  by  Franklin.  The  stochastic  inverse  is  derived  and 
inverse  of  Penrose-  and  Moore.  A Backus-Gilbert  type  tradeoff  curve  is  constructed  for  the  problem  of  estimating  the 
solution  to  the  linear  system  in  the  presence  of  noise.  It  is  shown  that  the  stochastic  inverse  represents  an  optimal 
point  on  this  tradeoff  curve.  In  the  appendix  a useful  form  of  the  solution  autocorrelation  operator  as  a member  of 
a one-parameter  family  of  smoothing  operators  is  derived. 


system  of  integral  equations 
this  linear  system  based  on 
related  to  the  generalized 


INTRODUCTION 


This  paper  addresses  the  problem  of  computing  particular  solutions  to  certain  finite  systems  of  linear  integral 
equations  that  arise  in  the  study  of  the  earth’s  interior.  On  the  earth’s  surface,  quantities  can  be  measured  that  are 
functionals  of  the  distribution  of  physical  parameters  at  depth.  The  earth’s  mass  and  moments  of  inertia,  functionals 
of  the  density  distribution.,  and  tin  travel  times  of  seismic  rays,  functionals  of  the  velocity  distribution,  are  examples 
of  this  kind  of  data.  !«>  essence,  the  geophysical  inverse  problem  is  this:  given  the  observations  of  a finite  number  of 
these  functionals,  what  is  the  distribution  in  the  earth’s  interior  of  the  parameters  or  which  they  depend?  The  formu- 
lation of  the  problem  used  in  this  paper  relates  small  changes  in  a chosen  model  to  small  changes  in  the  data  through 
a linear  transformation.  With  the  use  of  this  first-order  approxinw  i the  problem  is  reduced  to  the  solution  of  an 
ur.uerdetermined  system  of  linear  integral  equations.  Since  observations  are  invariably  contaminated  by  errors  intro- 
duced through  inaccurate  measurement,  only  estimates  of  values  of  the  data  functionals  for  earth  are  available, 
and  estimates  of  the  solution  must  be  made  with  this  realization.  A general  and  extensive  theory  concerning  the  solu- 
tion of  the  linear  geophysical  inverse  problem  for  inaccurately  known  data  has  been  provided  by  Backus  and  Gilbert 
[1967, 1968, 1970] . We  draw  heavily  from  the  concepts  and  terminology  developed  by  these  authors.  Our  concern 
here  is  the  application  of  stochastic  inverse  theory  presented  by  Franklin  [1970]  to  obtain  particular  solutions  to  the 
linear  system.  This  paper  represents  an  expansion  on  the  work  of  Jordan  and  Franklin  [1971  ] . Since  the  geophysical 
inverse  problem  is  an  example  of  a canonical  problem  in  the  mathematics  of  profile  inversion,  the  discussion  is  kept 
general  to  facilitate  other  applications.  Numerical  results  from  the  inversion  of  gross  earth  data  are  the  subject  of  a 
separate  paper  [Jordan  and  Anderson,  in  preparation]  and  are  not  presented  here. 


A model  is  here  defined  as  a real-valued  function  m specified  on  the  closed  interval  [0,1] . In  the  geophysical 
probltm  m represents  the  distribution  of,  say,  density  as  a function  of  rs>dius  in  the  earth.  The  model  space  m is 
the  Hilbert  space  completed  from  the  space  of  real-valued  piecewise  continuous  functions  square  integrable  on  the 
interval  (0,1  ] . The  inner  product  of  a pair  of  functions  m.m'  e m is  written  m • m'  and  taken  equal  to  the  scalar 


value  of  the  integral  £ m(x)m'(x)p(x)  dx.  The  weight  function  p(x)  is  required  to  be  strictly  positive  on 
[0,1].  Associated  with  this  inner  product  is  the  norm  Iml  = (m  • m)I/3.  Given  a linear  operator  L,  mapping 
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M into  itself  and  an  dement  m,  the  vector  L • m in  M whose  value  at  each  point  x equals  the  integral 

f L(x,  x')  m(jc')  p(x')  dx'  can  be  computed.  The  function  L(x,x')  is  the  kernel  of  the  operator  L.  The  kernel 
■/o 

of  the  identity  operator  I is  5(x  -x')/p(jc')  where  the  numerator  is  the  Dirac  delta  distribution.  L*  denotes  the 
transpose  of  L.  If  L = L*,  L is  symmetric. 


A linear  data  functional  ;s  a linear  functional  defined  over  the  model  space  M . (If  a function  is  nonlinear  but 
Frechet  differentiable,  then  it  can  be  approximated  by  a linear  data  functional  at  each  point.)  The  inverse  problen  is 
to  estimate  the  function  m0 , representing  the  actual  density  distribution  in  the  earth,  given  the  observed  values  of  an 
ordered  set  D ^ = {i,(m)  : i = 1,2,... , A/}of  N linear  data  functionals,  say  the  earth's  mass  and  moment  of 
(TV  = 2).  Corresponding  to  the  functional  d,(m)  there  exists  a unique  a,  e M,  the  kernel  of  d,-,  such  that 


a,-  • m = d,(m)  / = 1,2, . . . ,N  (1) 

Each  dj(m)  can  be  considered  as  the  ith  component  of  a vector  d in  the  JV -dimensional  Euclidian  space  E_/V 
The  inner  product  between  two  vectors  d and  d'  in  is  written  dd’  and  equals 


N 

1 

rt=l 


The  inner  product  is  assumed  to  be  dimensionally  homogeneous.  Let  d0  be  the  vector  whose  components  are  the 
observed  data:  then,  if  the  data  are  error-free,  m0  satisfies  the  operator  equation 


A • m0  = d0  (2) 

where  A:M-»E^  is  the  operator  whose  ith  row  is  the  vector  a,-  mapping  M onto  the  subspace  R(A)CE^, 
the  range  space  of  A.  Equation  (2)  implicitly  assumes  that  a0  e R(A). 


THE  GENERALIZED  INVERSE 


For  finite  N the  problem  of  computing  the  solution  to  Eq.  (2)  is  ill-posed  in  the  sense  that  the  solution  is  not 
unique.  In  fact,  A possesses  a null  manifold  N(A)  of  infinite  dimension.  For  each  heN,  A *h  = 0.  If 
hn  : n = 1 , 2,  3, . . . is  basis  of  N(A),  then  the  general  solution  to  Eq.  (2)  tan  be  written 


m = m + 


(3) 


where  m is  any  particular  solution  to  Eq.  (2)  and  the  coefficients  an  are  arbitrary  teal  numbers.  One  particular 
solution  of  interest  is  given  by  the  generalized  inverse  of  A \Moore,  1920;  Bjerhammxr,  1952;  Penrose,  1955; 
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7 'Seng,  1949] . R(A*>  is  the  space  spanned  by  the  set  )aj.i  = 1,2 so  M = N(A)  ® R(A*).  T’c  oper- 
ator A has  a unique  generalized  inverse  A^  : -►  M such  that 


A ‘ A+  " PR(A)  * 


At  A 


PR(A*> 


(4) 


where  Pr(A):  R(A)  and  Pr(a*)  : ^ R(A*)  are  orthogonal  projection  operators  (that  is,  P • P = P and 

P*  = P).  The  estimate 


m = At  d0  (5) 

is  the  unique  solution  that  minimizes  the  norm  1ml.  Using  Eqs  (2)  and  (4)  we  write 

m = A+  A - mo  = Pr(a*)  ' "V)  (6) 

Thus,  this  solution  corresponds  to  the  orthogonal  projection  of  any  solution  onto  the  subscape  R(A*>-  It  is  easily 
shown  that  At  = A*  (A  • A*)t,  reducing  the  computation  of  At  to  the  determination  of  the  generalized  inverse  of 
a symmetric  operator  on  E . If  A has  rank  N,  then  (A  * A*)t  = (A  • A*)-1  since  A • A*  is  nonsingular.  If  A 
has  rank  less  than  N,  other  algorithms  for  computing  (A  • A*)t  can  be  used,  such  as  that  of  Ben-lsrael  and  Chames, 
[1963]. 

The  form  of  Eq.  (6)  illustrates  an  important  point:  since  the  data  kernels  do  not  form  a complete  set,  the  value 
mo(x)  cannot  be  determined  at  each  x e [0,1] ; rather  a linear  average  of  mo  (x)  given  by 


S(x)  * f P R(A*)  (x-  *')  “o  (x  ) P (x)  dx’  (7) 


is  obtained.  Usually,  it  is  desirable  to  compute  an  avenge  which  is  at  least  localized  at  each  point.  Roughly  speaking, 
an  average  is  said  to  be  localized  if  the  contributions  to  the  integral  (7)  are  small  away  from  the  point  x . The  concent 
of  the  localized  linear  average  is  of  central  importance  in  the  Backus-Gilbert  theory  for  the  resolving  power  of  a linear 
system  [Backus  and  Gilbert , 19bo] . In  practice,  these  considerations  can  limit  the  usefulness  of  the  generalized  in- 
verse Backus  and  Gilbert  [1968]  examined  the  kernel  of  Pr(a*)  •n^  found  that,  for  typical  sets  of  eig  nfrequency 
data,  the  linear  averaging  associated  with  this  projection  was  not  sufficiently  localized  to  provide  useful  estimates  of 
the  density  in  the  earth.  Furtnermore,  Eq.  (5)  was  derived  under  the  assumption  that  the  data  are  perfectly  well-known. 
Actually  do  is  only  an  estimate  of  the  vector  A * mo  that  has  been  corrupted  by  error  or  “noise”  entering  through 
observational  errors  and  computational  inaccuracies.  Neglecting  this  error  can  yield  model  estimates  with  large  statis 
tical  uncertainties  [Backus  and  Gilbert,  1970] . 

Nevertheless,  solution  of  the  perturbation  equations  by  the  minimization  of  a specified  norm  is  an  appealing 
method  for  selecting  particular  solutions  to  Eq.  (2).  For  instance,  if  on  the  basis  of  auxiliary  information  there  is 
good  reason  for  believing  that  the  representation  nq,  is  somehow  dose  to  a particular  starting  mode),  such,  a minimi- 
zation can  provide  the  smallest  perturbation  available-in  a norm  sense- that  satisfies  Eq.  (2).  The  limitations  of  the 
generalized  inverse  can  be  overcome  by  appealing  to  a stochastic  formulation  of  this  linear  problem  [Franklin,  1970] . 
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THE  STOCHASTIC  INVERSE 


The  equation  corresponding  to  Eq.  (2)  for  inaccurate  data  can  be  cast  in  the  form 

A • mo  + n = d0  (8) 

where  n e is  a vector  containing  the  noise  components.  Since  these  components  have  some  unknown  scalar  value, 
the  error  is  described  only  in  terms  of  its  statistics.  Treated  here  is  the  case  for  Gaussian  noise  with  zero 
mean.  Following  Franklin  [1970]  we  consider  Eq.  (8)  to  be  a rtmple  of  the  stochastic  equation 

A • Ps  + Pn  = Pd  <9> 

where  p{  is  a stochastic  process  describing  the  solution  and  is  defined  over  M , and  p„  is  the  noise  process  and  pd 
the  data  'jrocess,  both  defined  over  E^.  The  problem  is  to  construct  the  best  linear  unbiased  estimate  pj  of  the 
solution  process  pj  resulting  from  the  application  of  some  linear  operator  B to  the  data  process  pd : 

Ps  = Bpd  (10) 

To  ensure  that  this  estimate  is  unbiased,  ps  and  p„  must  have  zero  expectation. 

The  process  ps  has  zero  expectation  if  the  expected  value  of  the  random  variable  ps  * g,  denoted  E [pj  • g] , 
is  zero  for  all  g in  the  model  space.  So  long  as  convergence  in  the  quadratic  sense  is  sufficient,  then  ps  c«n  be 
represented  by  the  decomposition: 


w 

£ a«f« 


where  |fn  : n * 1 , 2, 3, . . . } is  some  orthonormal  basis  for  M and  j an  : n - 1 , 2, 3, . . .}  is  a set  of  orthogonal 
Gaussian  random  variables.  The  autocorrelation  operator  of  the  process  px  is  the  symmetric  operator  mapping 
the  model  space  into  itself  such  that  g • C$s  * g = £ [(pj  • g)2J  for  all  gc  M.  The  term  on  the  right-hand  side  is 
the  variance  of  the  random  variable  ps  • g;  therefore,  Cjj  is  positive  semidefmite.  By  the  Karhunin-Loeve  theorem 
[Lo'eve,  1955,  p.  478] , jf„(  are  the  eigenfunctions  of  CJS  (taken  here  to  be  normalized)  and  j an2  : n - 1,  2,  3, 

. . . J , the  variances  of  the  an , are  its  eigenvalues 


Cjs  = y &n2  fn  fn 


The  linear  operator  fn  tn  is  defined  by  ' fB)*g  * ' *)•  If  the  spectral  coefficients  an*  equal  unity 

for  all  rt , then  Eq.  (11)  reduces  to  the  completeness  relation 
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02) 


1 = 


fn  fn 


Correspondingly,  the  solution-data  cross-correlation  operator  Cld : -*M  is  defined  by  the  bilinear  from 

E [("  • pI)(p(j  ~)] . Evidently,  Cjd  = Cdj*.  The  following  expansions  are  obtained  for  the  data  autocorrela- 
tion operator  and  the  solution-data  cross-correlation  operator  from  Eq.  (9) 


^dd  = ^ • A*  + A • Cjn  + Cns  • A*  + Cnn 

Csd  = C*s  * A*  + C & 


Assuming  that  the  solution  and  noise  processes  are  uncorrelated,  Csn  = Cm  - 0,  and  the  above  expressions  re- 
duce to 


Cdd  = A • Cjj  • A*  + C nn 
Cjd  = Cjj  • A 


(13) 


In  this  form,  the  data  autocorrelation  operator  Cdd  is  nonsingular  if  is  positive  definite. 

The  best  linear  estimate  of  the  solution  process  pj  is  the  process  pj  which  minimizes  the  quadratic  form 
eJ(g)  = £ |[(p,  - Ps)-E]*}  for  all  geM.  Substitution  losing  Eq.  (10)  and  expansion  of  the  autocorrelation 
operator  of  ps  - pj  yidds 

«2(l)  ■ f * C„  • g - | • C,dB*  • g - f • BCdj  • g + g • BCddB*  • g 

The  first  and  second  variations  of  the  functional  e2  with  respect  to  a variation  of  the  vector  q = B*  • g <j  fixed) 
arc 


= 2(qCdd  -*-C,d)8q 
«V)  = 8qCdd6q  + 2(qCdd  - g -C,d)  «q2 


The  functional  e2  (g)  is  stationary  if  and  only  if  6 (e2 ) = 0 for  all  arbitrary  variations  6q.  Therefore,  the  linear 
combination  Cddq  - Cdj  • g is  required  to  be  zero  for  all  g e M . Assuming  C 4^  to  be  nonsingular,  this  is  true 
if  and  only  if  B 55  CsdCdd'‘ • For  this  choice  the  second  variation  reduces  to  the  positive  definite  form  62  (e2  )= 
6 q Cdd8  q.  Since  this  stationary  point  is  unique  and  a minimum,  the  best  linear  estimate  of  pj  is 


B*  " c*d  cdd~l  Pd 


For  the  particular  sample  of  the  data  process  do , the  best  estimate  of  n>o  is,  using  Eq.  (13), 


fi  - Cjj  • A*(A  • Cjj  • A*  + CMrl  do 


(15) 


(16) 
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This  stochastic  inverse  is  well  suited  for  numerical  computations.  Its  validity  is  limited  only  by  the  assumptions  that 
the  solution  and  noise  are  uncorrelated  and  that  Cnn  is  positive  definite.  In  practice,  it  is  commonly  assumed  that 
the  error  components  are  themselves  uncorrelated;  then 


Cnn 


o I 


0 


07) 


The  positive  real  number  o,  J is  just  the  variance  of  the  i th  datum.  This  matrix  is  positive  definite,  thus  ensuring 
the  existence  of  C^-1 

Statistical  information  concerning  p$  and  p n contained  in  an  ensemble  of  samples  can  be  used  to  estimate 
their  autocorrelation  operators.  The  variances  in  Eq.  (17),  for  instance,  can  be  estimated  from  the  scatter  in  the  data. 

Information  concerning  the  distribution  of  the  solution  and  noise  processes  has  been  used  in  the  solution  of  the 
linear  estimation  problems  previously.  Equation  (16)  is  analogous  to  the  results  of  Wiener’s  theory  for  construction 
of  an  optimum  infinite-lag  smoothing  filter  [Davenport  and  Root,  1 958 1 and  was  explicitly  obtained  by  Strand  and 
Westwater  [1968] . It  reduces  to  Twomey s [1963]  results  for  the  special  case  Css  = I.  Cnn  = 7!. 

In  the  geophysical  problem  it  occurs  that  no  statistics  for  p$  exist.  Then,  what  is  the  significance  of  charac- 
terizing the  solution  as  a sample  of  a Gaussian  process  with  a zero  expectation  and  an  autocorrelation  operator  Cw? 
To  answer  this  question  we  assume  that  the  noise  is  identically  zero  and  that  both  C$j  and  A • CJS  • A*  are  positive 
definite.  Under  these  conditions  the  best  estimate  Eq.  (16)  reduces  to 


m = CSj  • A* (A  • Cw  • A*)-1  do  (18) 

This  solution  has  a simple  geometrical  interpretation.  Let  M'  be  the  space  whose  elements  are  the  members  of  M 
with  an  inner  product  defined  as  the  positive  definite  bilinear  form 


L , 


m • m = 


f f L(x,  *')  m(x)  m(x')  p(x)  p(x')  dx'dx  (19) 

•'o 


m,  m'eM',  where  L = CM-1  • The  solution  autocorrelation  operator  CM  is  idempotent  with  respect  to  this 
product  (Cjj  * Cjj  = Cjj);  it  is  the  identity  operator  on  M'.  Define  A':M'-+E^  as  the  linear  operator  A • Css. 
The  elements  of  N(A')  are  the  same  as  the  elements  of  N(A).  Substituting  A'^m0  for  do  in  Eq.  ( 1 8)  we  obtain 


m = P 


R(A'*) ' m« 


(20) 


where  the  operator  Pr(a'*)  : M'  -*  R(A'*)  is  the  orthogonal  projection  operator  onto  the  range  space  of  A'*,  tn 
this  notation, 


*R(A’*) 


* A'*  (A'  ^ A'*)-1 


A' 


(21) 
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Thus,  the  solution  given  by  Eq.  (18)  is  the  symmetric  projection  of  any  solution,  in  particular  ato,  onto  the  manifold 
R(A'*),  in  the  inner  product  being  defined  by  Eq.  (19)  in  terms  of  the  solution  autocorrelation  operator. 

Often  it  is  useful  to  prescribe  Cj*  as  a smoothing  operator  corresponding  to  a priori  assumptions  about  the 
smoothness  of  the  solution.  This  is  a way  to  incorporate  additional  information  about  the  solution  not  contained  in 
the  data  set  D^.  Because  this  point  is  important  for  physical  applications,  it  bean  further  discussion.  To  illustrate 
what  it  means  for  Cjj  to  be  a smoothing  operator  or,  in  other  terminology,  a low-pass  filter,  we  return  to  its  Karhunin- 
Loeve  expansion  Eq.  (1 1).  Assume  that  the  orthonormal  basis  jfn  : n * 1, 2, 3, . . . | has  been  ordered  such  that,  if 
f,  is  a “smoother”  function  than  tj  -ay,  hu  fewer  zero  crossings  in  [0,1  ] -then  i < /.  In  the  appendix  we  con- 
sider an  ordered  complete  set  with  this  property  provided  by  the  eigenfunctions  of  a Sturm-Liouville  system.  We  will 
call  CB  a smoothing  operator  if  a*  > a*  for  / < /.  This  allows  an  ordering  to  be  defined  on  M:  a vector  mt 
is  said  to  be  smoother  than  a vector  m2  if 


L - C*-' 


(22) 


Here,  Iml^  = (m*m)>/a.  Taking  for  mt  the  solution  (18)  and  for  nh  the  solution  (5),  we  have,  by  the  proper- 
ties of  projection,  Imil  > Intjl  and  Im,^  < Intjl^  if  mt  # mj . Therefore,  by  Eq.  (22),  mi  is  smoother 
then  mt  is  smoother  than  m2. 

Also,  numerical  computations  with  sets  of  grou  earth  data  have  diown  that  die  introduction  of  a smoothing 
operator  for  the  signal  autocorrelation  operator  can  bav?  the  effect  of  providing  more  localized  linear  averages  of  the 
representation  mo  than  those  given  by  the  integral  (7). 


THE  TRADEOFF  CURVE 


The  Backus-Gilbert  theory  of  linear  estimation  [Backus  md  Gilbert.  1968, 1970]  suggests  that  for  the  problem 
described  in  the  previous  sectioc-the  estimation  of  a function  mo  given  the  values  of  a set  of  linear  data  functionals 
do  corrupted  by  a random  noise-there  exists  a tradeoff  between  the  ability  to  resohe  the  detail  of  mo  and  the 
reiiabitity  of  the  estimate  R.  In  this  section,  we  investigate  the  optimality  of  the  stochastic  solution  (16)  in  terms 

of  its  relative  position  on  a Backus-Gilbert  type  tradeoff  curve.  The  generalized  inner  product  ~ • ~ and  norm 
l~lL  introduced  in  the  previous  section  will  be  retained  throu^tout  the  analysis,  but  die  use  of  primes  to  distinguish 
product*  is  suspended  by  defining  the  generalized  product  over  M instead  of  M'. 

An  estimate  ft  of  the  function  m«  a defying 

A^mo  + n ■ do  (23) 

is  sought  given  A,  d*  and  the  statistics  of  a Gaussian  noise  process  pN  from  which  n is  a sample.  The  process  pR 
is  assumed  to  have  zero  expectation  and  a positive  definite  autocorrelation  operator  Cm.  The  null  space  N(A)  is 

populated  by  those  members  of  h for  which  A^h  ■ 0.  Since  the  data  contain  no  information  about  the  components 
of  mo  in  the  null  space,  the  estimate  R is  required  to  belong  to  the  ruige  space  R(A*).  This  statement  is  equivalent 
to  constraining  R to  be  a linear  combination  of  the  data  kernels 

R ■ A*b  , for  some  b e (24) 
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The  vector  b is  to  be  determined  by  minimization  of  an  appropriate  scalar  measure  of  the  error  of  estimation  of 
mo , which  we  are  free  to  construct. 

One  obvious  measure  of  the  error  of  estimation  of  m3  is  the  norm  of  the  difference  between  m0  and  m: 

e,3(b)  = Unto  - A*b II ^ (25) 

By  the  orthogonality  of  the  spaces  N(A)  and  R(A*)  it  is  clear  that  the  projection  of  m0  in  the  null  space  con- 
tributes to  61 3 its  full  squared  norm  regardless  of  the  choice  of  b.  In  fact,  et  3 is  minimized  at  the  value 

«PN(A)^mo  f°r  b = (A^A*)^do-  This  is  the  solution  given  by  the  generalized  inverse  Eq.  (18) 

If  the  data  were  perfectly  accurate,  the  best  linear  estimate  of  the  function  m0  would  result  from  the  appli- 
cation of  the  generalized  inverse  of  A to  the  data  sampie  vector  d0.  However,  n =£0  implies  an  uncertainty  in 
do  and,  correspondingly,  in  m.  A measure  of  the  uncertainty  of  any  estimate  of  the  form  Eq.  (24)  due  to  noise  in 
the  data  is  the  variance  e23  of  the  projection  of  pn  onto  b.  By  definition, 

e,3(b)  = bCnnb  (26) 


An  attempt  to  minimize  this  error  with  respect  to  a variation  of  b yields  the  trivial  solution  b = 0. 

In  general,  the  two  measures  of  error  e,2  and  e23  compete:  minimizing  e, 2 alone  results  in  an  estimate 
for  which  e23  is  large,  while  minimizing  e23  without  regard  for  how  well  the  estimate  satisfies  Eq.  (23)  yields  e, 2 
at  its  maximum  value.  To  explore  the  possibilities  for  a compromise,  consider  the  quadratic  measure  of  eiror 


e3(0,b)  = e ! 3 (b)  cos  0 + e22(b)  sin  0 


(27) 


composed  of  a weighted  sum  of  £i3  and  e23.  The  weighting  is  parameterized  by  an  angle  0 that  varies  on  the 
interval  [0,  tr/2],  so  that  e2(0,  b)  = e , 3 (b)  and  e3(/r/2,  b)  * e23(b).  For  a given  fixed  0,  e3(0,b)  can  be 
minimized  with  respect  to  a variation  of  b.  The  unique  b that  minimizes  e2(0,  b)  is 


b(0)  = (A- A*  + tan  0 Cnw)-1  A • mo 


(28) 


Replacing  the  vector  A^nto  by  its  estimate  d0  and  substituting  Eq.  (28)  into  (24)  gives  the  best  estimate  of  mo  as 

m(0)  = A*(A^A*  + tan0CMnr' d0  (29) 

Special  cases  of  Eq.  (29)  include  the  generalized  inverse  (0  * 0)  and  the  stochastic  inverse  (0  = jr/4). 

The  solution  b'(0)  can  be  put  into  Eqs.  (25)  and  (26)  to  obtain  Ci3  and  e22  as  functions  of  0 


e'i3(0)  ■ IniolL2  + lR(0)^ir.  "L3  - 2nio^R(0)^mo 
e23(0)  - mo^A*Q-‘(0)  Cm  Q"’(0)  A^nto 


(30) 
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The  operator!  Q(0)  and  R(0)  appearing  in  these  expressions  are  defined  by 


Q(0)  « A^A*  + tan  8 CM 
R(0)  - A*  <T*(0)  A 


(31) 


Equations  (30)  determine  a curve  parameterized  by  0 in  the  eia  - eja  plane.  Backus  and  Gilbert  [1970]  have 
termed  such  graphs  tradeoff  curves.  One  can  easily  show  the  following: 


— [*ia]  > 0 

a* 

~ [«a3]  < 0 
30  1 1 


d[£a *] 

d[e,J] 

da[eaa] 

d[e,a]a 


* - cot  0 


>0  0 < 0 < all 


From  these  relations  we  infer  that  the  tradeoff  curve  between  eta  and  eaa  is  monotonically  decreasing  and  convex 
towards  the  origin  and  that  0 is  the  acute  angle  between  the  tangent  to  the  tradeoff  curve  and  the  eaa  axis.  These 
and  other  qualitative  features  of  the  tradeoff  curve  have  been  given  by  Backus  and  Gilbert  [1970] , who  point  out 
that  is  is  best  to  avoid  solutions  corresponding  to  extremal  values  of  0 . Clearly , the  generalized  inverse  solution  is 
a poor  choice  on  the  tradeoff  curve.  The  optimal  point  is  where  d[eja]  /d[eia]  * -1  corresponding  to  the 
solution  Eq.  (16)  given  by  the  stochastic  inverse. 
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APPENDIX 


Construction  of  the  operator  Csi 


In  this  appendix  we  present  a useful  form  of  the  solution  auto-correlation  operator  CSJ  based  on  its  Karhunin- 
Loeve  expansion  Eq.  (1 1).  For  generality  we  restrict  the  definition  to  an  arbitrary  semi-open  subinterval  (a,  ft]  of 
(0, 1],  0 < a < b < 1.  The  construction  yields  Css  as  a member  of  a one-parameter  family  of  smoothing 
operators  with  norm  (and  therefore  gain)  less  than  or  equal  to  one. 

The  operator  CM  is  entirely  determined  by  the  specification  of  an  or thonormal  basis  { fn  : n = 1 , 2, 3, . . . | 
and  a set  of  spectral  coefficients  = 1,2,3,. ..1.  Let  A be  the  Sturm-Liouville  operator:  A(x)  = 

d/d x(p(x)  d/dx]  - q(x),  x e(a,  ft] . The  system  with  homogeneous  boundary  condition; 


A(x)  fix)  + k1  p(x)  f(x) 

f(x)  + Xf(x)l 

J x =a 

f(x)  + H V)| 
b*  Jx=ft 

generates  a set  of  eigenvectors  { f„ } , taken  to  be  normalized,  that  is  complete  on  the  interval  (a,  ft]  [Morse  a J 
Feshback,  1953] . The  eigenvalues  kn2  can  be  ordered  as  a continuously  increasing  sequence;  fc,  2 < k22  < ■ ■ . 

< kn2  < ...  . With  this  ordeiing  the  number  of  nodes  in  the  eigenfunctions  fn  between  a and  ft  also  forms  a 
continuously  increasing  sequence  [Morse  and  Feshbach,  1953,  p.  722] . This  prosides  the  necessary  oidering  of  the 
basis  fn  discussed  in  the  text.  For  a given  scalar  value  of  the  parameter  k we  specify  the  soecKal  coefficients 
by 


= 0, 


= 0 


(Al) 


a 


7 


n 


Ic* 


«*  1,2,3, 


(A2) 


Particularized  in  this  way  C$j  has  the  following  desired  properties: 


(i)  IC„I  < 1 

(ii)  lim  #(I  _ Cjj)  • ml  * 0,  n e M 
*-*•«> 

(iii)  for  k finite,  Ca  behaves  as  a low  pass  filte. . i.e., 

£*iJ  > ctj1  > . . . > an 1 > . . . 
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iWfi 


J.  .. 

'-<fv  . 


Then  the  kernel  of  Ca  satisfies  the  inhomogeneous  system 


1 

1 

*V*) 


1 "1  6(x  - x„) 

A(x)  C„(x0,x)  ■ 


ti 

[i 


p(x)  J 

CSI(x0,x)  + XCj,(xaix) 


p(*) 


]X*fl 


Cn(x0,x)  + /iCM(x0,x) 


)1  - 0 

Jx*i> 


(A3) 


As  an  example  we  solve  this  system  for  the  special  case  p (x)  * p(x)  * x1 , q{x)  • 0.  in  this  case  the  system 
Eq.  (A1 ) has  a singular  point  at  x * 0.  Solutions  to  Eq.  (Al)  are  the  spherical  Bessel  functions  of  angular  order 
zero.  The  solution  to  Eq.  (A3)  is 


Ic  j -*lx-x0l  ] f -k(b-o) 
2xx0 


Cjj(x,x0)  - ie  + “ |^e  cosh k(x0~x) 


+ B cosh  k(a  + b -x0  -x)  + C sinh  k(a  + b 


(A4) 


w=*h 


A * [1  -a(*  + A)j  {1  +h(*-/i)] 

6 = Aa  + fib  - |Aa  - !fc(A  + /i)  + A#t]  ab  - 1 
C ■ k{i>  - a) 

Da  [l  - Aa  - /it  (Au  - A1  )ob)  sinh  k{b  —u) 

-k[b-a  + (n  - A )<ib ) cosh  k(b -a) 

Figure  I dhows  th*  kernel  of  Cj*  on  the  nterval  (0, 1 ] , centered 
at  x0  * 0.5.  Here  we  take  1 In  * 1/A  x 0 and  display  the 
kernel  for  k values  of  5, 20,  and  50. 
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DISCUSSION 


Gross:  Apparently  the  variance  of  what  you  want,  and  the  resolution,  cannot  be  independently  minimized  so  that 

one  forms  a sum  with  a weighting  function  [and  for  this  particular  parameter]  therr  •«  an  infinite  set  of  solu- 
tions, and  one  chooses  the  particular  value  of  interest.  What  confidence  does  one  have  in  this  solution? 

Jordan:  This  particular  solution  represents  a point  on  the  tradeoff  curve.  The  second  minimization  shown,  where 

we  cast  the  measure  of  our  error  in  terms  of  the  sum  of  two  terms,  is  analogous  to  the  same  minimization  that 
Parker  and  Johnson  talked  about.  The  uncertainty  in  the  resolvable  part  of  the  solution  is  given  by  the  second 
term  of  th<s  sum.  The  erro'  due  to  finite  resolution  is  expressed  in  the  first  term.  Gwen  only  the  observations, 
we  can  compute  the  second  term,  but  not  the  first. 
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8.  MISCELLANEOUS  CONTRIBUTIONS 


The  papers  presented  in  this  chapter  are  accounts  of  voluntary  contributions  presented  during  the  two  work- 
shop sessions.  They  expand  on  points  made  in  previous  chapters  or  describe  some  inversion  problems  not  properly 
falling  in  one  of  the  major  disciplines  considered  at  the  Workshop.  The  last  section  of  the  Chapter  is  an  edited 
account  of  the  more  important  discussions  at  the  Workshop  panel  sessions. 

J.  Shmoys  chaired  both  Workshop  sessions.  Hie  published  papers  are  authors’  summaries  of  their  oral  com- 
ments or  edited  transcriptions  of  their  comments. 


SOME  MEfHODS  FOR  DETERMINING  THE  PROFILE 
FUNCTIONS  OF  INHOMOGENEOUS  MEDIA 
R.  Mittra,  D.  H.  Schaubert  and  M.  Mostafavi 
University  of  Illinois 

N73- 116  22  ABSTRACT 

The  problem  of  determining  the  electric  permittivity  of  an  inhomogeneous  dielectric  medium  is  considered. 
Two  analytical  methods  and  one  numerical  method  for  solving  this  problem  are  presented.  All  these  methods  utilize 
the  reflection  coefficient  measured  as  a function  of  the  angular  spectrum  variable  |3.  This  allows  one  to  make 
measurements  at  a fixed  frequency,  and  these  methods,  therefore,  can  be  used  when  the  medium  being  studied  is 
dispensive.  A few  examples  are  provided  and  some  comments  arc  made  regarding  the  stability  of  solutions  obtained 
using  the  various  methods. 


INTRODUCTION 


The  problems  associated  with  making  remote  probing  measurements  are  known  to  workers  in  many  fields. 
Engineers,  physicists,  biologists,  geologists,  ecologists,  and  many  others  have  been  confronted  with  the  problem  of 
determining  the  nature  of  an  object  when  it  is  impossible  or  undersirable  to  place  measuring  devices  directly  at  the 
point  of  interest.  It  then  becomes  necessary  to  make  measurements  at  some  point  removed  from  the  object  and  to 
determine  the  nature  of  the  object  from  these  measurements. 

The  general  remote  probing  problem  is  very  difficult  to  solve.  This  leads  most  authors  to  consider  a simplified 
version  of  the  problem,  that  of  determining  the  characteristics  of  an  inhomogeneous  medium  that  is  uniform  in  two 
dimensions  and  varies  in  the  other  dimension.  Also,  the  variations  are  assumed  to  result  from  only  a few  parameters. 

The  particular  type  of  problem  discussed  here  is  that  of  determining  the  electrical  permittivity  of  a medium 
that  is  uniform  in  the  x and  y directions  and  varies  according  to  the  profile  function  k (z)  in  the  z direction. 

Using  the  techniques  of  Gelfand  and  Levitan  [19SS]  and  Marchenko  [19SS] , many  authors,  including  Becker 
and  Sharpe  [1969]  and  Kay  [1959] , have  reported  the  solution  of  this  type  of  problem.  All  of  their  work,  however, 
requires  that  the  response  of  the  medium  be  known  as  a function  of  the  tempond  frequency.  It  is  also  required  that 
the  medium  itself  be  independent  of  frequency- that  is,  nondispersive.  Often,  however,  it  is  desirable  to  make 
measurements  at  a single  frequency.  Therefore,  in  this  paper  we  consider  the  remote  probing  problem  in  the  angular 
spectrum  domain.  Here  the  independent  variable  0 is  the  spatiul  frequency,  which  can  be  associated  with  k sin  0 
where  k is  the  free  space  wave  number  and  0 is  the  angle  of  incidence  of  a probing  plane  wave. 

The  technique  by  which  the  response  of  the  medium  is  measured  is  to  illuminate  the  medium  using  a spatially 
confined  source.  The  total  field  at  the  interface  is  then  measured.  The  information  is  then  converted  to  the  angular 
spectrum  domain  by  taking  the  Fourier  transform  with  respect  to  the  x direction. 

In  the  next  section,  the  differential  equation  describing  the  problem  is  derived.  Three  different  methods  of 
obtaining  the  profile  are  then  discussed.  The  first  method  makes  use  of  the  form  of  the  equation  in  the  Unfit  of  low 
temporal  frequencies  [ScJwuberf  and  Mittra,  1970] . The  second  method  can  be  used  for  arbitrary  frequency  and  is 
based  on  a new  representation  for  the  wave  function  which  is  appropriate  for  the  angular  spectrum  domain.  The 
third  method  is  a direct  numerical  approach  which  uses  parameter  optimization  techniques  [Mostafavi  and  Mittra, 
1970] . 
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FORMULATION  OF  THE  PROBLEM 


The  geometry  of  the  problem  to  be  considered  is 
shown  in  figure  1 . Note  that  there  are  no  variations  with 
respect  to  the  y direction.  The  dielectric  is  also  uniform 
in  the  x direction  and  varies  in  the  z direction  as  n(z)e o 
where  e0  is  the  permittivity  of  fiee  space.  The  permeabil- 
ity is  assumed  to  be  constant  and  equal  to  that  of  free  space. 
It  is  assumed  that  a known  electromagentic  wave  of  a single 
frequency  is  incident  from  the  left  on  the  air-dielectric 
interface  at  z = 0 and  that  the  tangential  fields  at  the  sur- 
face of  the  dielectric  can  be  measured  for  all  values  of  x. 

Of  course,  the  actual  measurement  could  be  performed  at 
some  plane  z = at,  a<0,  and  then  the  fields  at  the  interface 
could  be  computed  since  the  propagation  in  free  space  is 
completely  known. 


If  the  incident  field  is  polarized  such  that  the  com-  Figure  1.-  Remote  probing  problem  in  the  angular 

ponents  Ex,  Ez,  and  Hy  are  all  zero,  then  propagation  spectrum  domain, 

in  the  medium  is  completely  described  by  the  scalar  equation 


V2v(x,z)  + u2fi0K(z)€0y(x.  z)  = 0 (1) 

(cj  is  the  angular  trequency  of  the  incident  wave).  The  function  v(x,  z ) may  be  identified  with  Ey  or  /^.which- 
ever is  more  convenient  for  the  problem  at  hand. 

To  obtain  the  differential  equation  in  the  angular  spectrum  domain,  we  introduce  the  Fourier  transform 
relations: 


u(fi,z)  = / v(x,  z)  exp  (~jpx)  dx  (2) 

'/_00 

v(x,  z)  * - —J  u(P,z)txp(/Px)dp 

2 ir  J~o° 

Then  Eq.  ("  ’ ecomes 

d + [wjMok(z)co  “ 01]  «(0,z)  = 0 (4) 

This  is  the  differential  equation  we  are  seekina.  Recall  that  w,  and  e0  are  constants,  (3  is  the  independent 
variable,  and  k(z)  is  the  unknown  profile  function. 

METHOD  1:  A LINEAR  INTEGRAL  EQUATION 

In  this  method,  we  assunre  that  ths  medium  is  terminated  at  i = L by  a perfect  electric  conducting  wail  (see 
fig.  2).  This  simplifies  the  expressions  by  causing  some  of  the  boundary  terms  to  vanish. 


We  look  now  at  Eq.  (4)  and  consider  the  special  case 
= 0. 

— PS(fi,z)  = 0 (5) 

dz 2 

Note  that  S((i,z ) is  the  static  solution  for  //2(0,z)  and 
is  independent  of  k(z).  Therefore  S(0,z)  can  be  calculated. 

S(0,z)  = Sosinh[0(z-L)]  (6) 


Figure  2.-  Geometry  of  the  remote  probing  problem  with  a 

perfectly  conducting  plane  at  z = L We  can  now  use  Eqs.  (4),  (5),  and  (6)  and  the  fact  that  both 

u(0,z)  and  S(fi, z)  vanish  at  z = L,  to  arrive  at  the  follow- 
ing integral  equation  for  k (z). 


cp2Mo£o  I k(z)  u(0,z)  sinh[0(z -L)l  dz  = -0  cosh[0L]  u(0,O) -sinh[0I] 

dz 


(7) 


Two  points  should  be  noted  about  Eq.  (7).  First,  the  right-hand  side  is  related  to  the  tangential  fields  at  the 
surface  of  the  dielectric  and,  thus,  can  be  determined  from  the  input  impedance.  Second,  Eq.  (6)  is  nonlinear  be- 
cause u(0,z)  depends  on  k(z).  Therefore,  there  is  no  systematic  procedure  that  can  be  used  to  find  k(z)  from 
Eq.  (7).  To  circumvent  this  problem,  a linear  form  of  the  integral  equation  is  obtained  by  expanding  u(0,  z)  in  a 
power  series  of  to , the  angular  frequency. 


«(0,z) 


00 

£ 


Cw(0,z)con 


C„(0,2) 


1 a"»(0,z) 

n\  3co" 


co  = 0 


(8) 


It  can  be  proven  that  for  w sufficiently  small,  no  surface  wave  of  this  polarization  will  propagate  and  that  Eq.  (8) 
is  a uniformly  convergent  series.  This  allows  us  to  substitute  Eq.  (8)  into  (7)  and  interchange  the  order  of  summation 
and  integration.  Equating  the  coefficients  of  like  powers  of  co  yields  a series  of  equations.  The  co2  term,  in  par- 
ticular yields 


o 


CL 

J K(z)C0  (0,2) sinhl0(z-i)]  dz 


-0cosh[0£]  C,(0,O)  - sinh(0L]  IQ  &.*■?. 


2=0 


(9) 


To  simp'ify  Eq.  (9),  note  that  Co(0,z)  = u(0,z)|  . But,  for  co  = 0,  Eq.  (4)  becomes  identical  with 

I to  = 0 

Eq.  (5),  and  u (0 , z)  becomes  identical  with  S(0 , z).  Therefore  Eq.  (9)  can  be  written 


84 


cL 

Moeo  J k(z)  sinh1 0(z  ~L)  dz  = -0  cosh[0L)  Cj  (0, 0)  - sinh[0I] 


(10) 


I z = 0 


This  is  the  linear  integral  equation  that  k(z)  must  satisfy. 


It  should  be  noted  that  Eq.  (10)  is  exact.  No  approximations  were  made  in  the  expansion  of  u (0,  z)  or  in  the 
matching  of  coefficients.  It  should  also  be  noted  that  the  linear  equation  is  not  obtained  without  cost.  The  right- 
hand  side  of  Eq.  (10)  involves  the  second  derivative  with  respect  to  frequency  of  the  tangential  fields.  This  requires 
that  the  probing  be  done  using  frequencies  in  the  neighborhood  of  zero.  This,  however,  may  not  be  a limitation  since 
some  remote  probing,  such  as  the  determination  of  the  electrical  properties  of  the  earth,  is  often  carried  out  at  very 
low  frequencies. 


Finally,  it  should  be  noted  that  the  satisfaction  of  Eq.  (10)  is  sufficient  to  guarantee  that  the  equations  obtained 
by  matching  other  powers  of  to  are  all  satisfied. 


One  of  the  greatest  advantages  of  the  linear  form  of  the  integral  equation  is  that  the  solution  can  be  shown  to 
be  unique.  To  do  so,  we  must  show  that  the  homogeneous  equation 


| /(z)  sinh*  [0(z-£)]  dz  = 0 

•Jn 


(ii) 


has  only  the  trival  solution.  Rewriting  Eq.  (1 1),  we  have 


if 


/(z)exp[20(z-L)] 


d2tll 


/(z)exp[-20(z-I)] 


(L 

dz  - I /(z)dz  = 0 

j 0 


(12) 


which  says  that  a linear  combination  of  three  terms  must  vanish  for  all  values  of  0.  Clearly  the  last  term  in  Eq.  (12) 
is  independent  of  0;  considering  its  first  term,  if  l/(z)i  < M for  0 < z < L , 


J j exPl2^z  “£)1  dz  J < M 2-^ 


(13) 


Therefore,  for  large  0,  the  first  integral  is  of  the  order  of  0”‘ . Now  consider  the  second  term  it.  Eq.  (12). 


/L/(2)exp[-20(z-I)]  d z = exp[20L]  fLf(z)  exp  [-20z]  dz 


(14) 


The  integral  on  the  right-hand  side  is  die  finite  Laplace  transform  integral,  tkuce,  if  /(z)  is  a Laplace  transformable 
function,  the  second  term  of  Eq  (12)  behaves  as  exp[20L]  for  large  0,  since  the  integral  / /(z)exp(-20z)  dz 
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is  algebraic  for  large  0.  Therefore,  Eq.  (12)  can  be  satisfied  for  all  0 only  if  f(z)  is  identically  zero.  Hence,  the 
solution  of  Eq.  (10)  is  unique  within  the  class  of  bounded,  Laplace  transformable  functions. 

A final  comment  about  the  numerical  behavior  of  the  solution  of  Eq.  (10)  is  needed.  The  matrices  obtained  by 
various  quadratures  on  Eq.  (10)  were  found  to  be  ill-conditioned.  That  is,  small  deviations  in  the  data  appearing  on 
the  right-hand  side  of  the  equation  lead  to  large  deviations  in  the  calculated  profile  function.  This  limits  the  applica- 
tion of  this  method  in  practical  problems.  However,  Eq.  (10)  was  proven  to  be  nonsingular;  hence,  better  inversion 
methods  than  were  used  in  our  investigation  could  lead  to  usable  results. 


METHOD  2:  A NEW  REPRESENTATION 


Recall  that  method  1 required  the  angular  spectrum  response  of  the  medium  to  be  obtained  for  angular  frequen- 
cies in  the  vicinity  of  go  = 0.  The  method  presented  in  this  section  can  be  used  for  any  fixed  nonzero  value  of  go. 

Before  continuing,  we  introduce  an  equivalent  form  of  Eq.  (4),  which  will  be  used  as  the  basic  differential 
equation  in  this  section 


m"(7.z)  + y2u(y,z)  = Q(z)u(y,z)  (15) 

where 

y2  = k2  - 02 
Q(z ) » ~k2  (k(z)  - 1] 
k2  ~ GOJpoeO 

The  prime  denotes  differentiation  with  respect  to  z.  Note  that  if  we  determine  Q(z),  k (z)  can  easily  be  found. 

We  will  assume  that  Q(z)  is  bounded  and  that 


/ z|g(z)|  dz  < «» 
o 


(16) 


Urn  u(y,z)  = expliyz] 

2 + ot> 


We  then  write  the  solution  to  Eq.  (1 5)  in  the  form 


«(7>*)  * exp[/irz]  + 


1 

2 it/ 


u(tl,z)exp[f(7  + n)z]  (7  + t?)"1  P(n)dt» 


(IT) 


(18) 
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The  associated  profile  function  is  then  giver,  by  the  expression 


Q(z)  = — J [lu'O?, z)  - t ?«(ri,z)]  exp[n?z]?(T?)  dr>  (19) 

m -<*> 

The  function  P(  tj)  in  Eqs.  (lb)  and  (19)  contains  infoimztion  about  the  medium  being  studied.  We  can  determine 
P( rj)  from  a knowledge  of  u(y,o } by  considering  Eq.  (18)  for  z - 0.  That  :s,  the  equation 

u(y,o)  = i + ~ — / u(v,ol(y  + ri)~l  P(ri)  dn  (20) 

2 7 ri 


is  a linear  integral  equation  for  P(v) 

Usin«  these  equations,  we  obtain  the  unknown  functions  Q(z)  by  the  following  steps' 

1.  Knowing  u(y,  o),  solve  Eq.  (20)  for  P[rj), 

2.  For  various  f’xed  values  of  2,  solve  Eq.  (18)  for  u(y,z) 

3.  Obtain  Q(z)  ItomEq.  (19). 

It  can  be  sh'iwn  that  Eq.  (18)  is  a solution  to  (15).  This  is  done  by  substituting  Eq.  ( 1 8)  into  (15),  yielding 
an  integral  equation  of  the  form 


« OO 

T(7,z)=~—  f r(T?,z)exp(r(yi  i?)z]  (T  + n)-1  P(jj!  dr/ 
2 m m-°° 


(21! 


where  T(y,z)  = u"(y,z)  + y2  u(y,z)  - Q(z)  u(y,z).  For  functions  of  the  type  encountered  in  this  problem, 
Eq.  (21)  has  only  the  solution  T{y,z)  s 0.  Therefore,  Fq.  (15)  ts  satisfied. 


Using  ar>  example  due  to  Sharpe  [1963],  the  above  method  was  tested  for  numerical  stability,  n 
this  case,  the  field  the  interface  v.as  given  by 


u(y,o) 


(7  + 0* 

It  + (V7/2)+ 072))  {T-( V7/2)+o/2)] 


(21) 


Figure  3 indicates  the  close  comparison  between  he  calcu- 
lated result  and  the  analytical  result  obtained  by  'marpe. 


METHOD  3:  PARAMETER  OPTIMIZATION 


In  contrast  to  the  two  methods  presented  above,  the 
third  method  is  numerical  in  nature.  This  method  is  more 
general  than  the  previous  ones,  but  it  dies  lack  the  analyt- 
ical niceties  that  are  present  in  those  methods.  For  instance, 


figure  3.  - Results  obtained  when  method  % war  applied 
to  Sharpe's  example. 
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i-i  Xvify 


it  is  impossible  to  prove  uniqueness  of  the  solution.  Nevertheless,  this  method  can  use  data  obtained  in  the  angular 
spectrum  domain  or  the  frequency  domain,  or  both,  and  it  can  be  generalized  to  a wider  class  of  geometries-for 
example,  circular  columns  and  sheath  configurations  in  the  neighborhood  of  space  vehicles.  In  addition,  this  method 
is  numerically  stable. 

The  method  is  based  on  the  parameter  optimization  approach  commonly  used  in  systems  theorv.  The  method 
seeks  to  minimize  a performance  index  function  F,  which  is  defined  as  the  norm  of  the  error  of  the  response . 


F(k,,  Kj,...,Xm) 


P Q 

|u£(Wy,fy,K)  ~ u(cj|',^,X1,K1,...,Km)|1 

M M 


(23) 


where 


Kj  are  unknown  parameters  introduced  to  describe  the  medium, 

ug( coy . &j,  k)  * surface  field  for  the  true  medium  described  by  a profile  function  k(z)  when  measured  for 
angular  frequency  coy  and  spatial  frequency  . 

Kt  ,k3  , . . . ,Km)  = surface  field  for  a trial  medium  described  by  the  parameters  Ki,k2 nm 

when  measured  for  angular  frequency  toy  and  spatial  frequency  fy. 


figure  1-  Profiles  v f .'electric  constant  k(i)  “ for  a 
slab  terminated  by  a perfect  conductor  at  t ■ 1. 
Rotentrock’t  method. 


The  terms  ur (coy , 0y , x ) are  the  measured  values  of  the 
field  for  the  medium  being  investigated. 

The  strategy  of  the  method  is  as  follows.  One  begins 
r,ith  a continuous  or  layered  model  for  the  medium  involv- 
ing the  parameters  x t } , . . . , xm . With  the  parameters 
set  equal  to  some  reasonable  values,  the  response  of  this 
trial  medium  for  each  coy  and  (5y  is  calculated  using  Eq. 

(4)  and  some  standard  numerical  technique  for  solving 
differential  equations.  This  response  is  the  u(t oy,fy,x., 

X] , . . . , xm)  term  in  Eq.  (23).  Once  the  performance 
index  F is  calculated,  the  optimization  algorithm  deter- 
mines a new  set  of  parameter  values  and  the  process  is 
repeated  until  a preset  stopping  criterion  is  satisfied-for 
example,  F is  less  than  a given  error  tolerance  e . 

The  optimization  technique  has  been  applied  to  a 
variety  of  inversion  problems  involving  inhomogeneous 
media  with  continuous  as  well  as  discrete  variations  in 
the  profile  function,  for  both  planar  and  circular  geometries. 

The  numerical  calculations  were  done  using  two  differ- 
ent optimization  algorithms.  Figure  4 shows  the  results 
using  Rosenbrock 's  [I960]  method  for  a profile  func- 
tion k(z)  * exp  (2z).  Figure  5 shows  the  results  obtained 
using  the  conjugate  gradients  method  [Fletcher  and  Reeves, 
1964]  on  the  same  profile.  Figure  6 shows  the  results 
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obtained  using  ihe  conjugate  gradients  method  when 
the  data  was  polluted  by  noise.  The  very  good  agree- 
ment here  indicates  ihat  this  method  is  stable  enough 
to  be  used  with  laboratory  data. 

It  ;an  be  seen  that  the  computed  results  agree 
with  the  true  profile  better  in  the  vicinity  of  z ~ 0. 
Tins  can  t -;  explained  by  thi  fact  the  signal  returned 
from  the  deeper  inside  of  the  medium  (nea  the  per- 
fect conducting  wall)  is  masked  by  traveling  thiough 
the  front  portion  of  the  medium.  Thus,  vacations 
occuring  deep  in  the  medium  have  little  effect  on  the 
fields  measured  at  the  interface.  Figures  7 and  8 show 
the  results  obtai  cd  for  a profile  illuminated  from  the 
left  and  from  the  right,  respectively.  In  both  cases  the 
results  are  better  near  the  interface  than  near  the 
perfect  conductor. 


Figure  5.-  Profile  of  dielectric  constant  * *2*  for 
a slab  terminated  by  a perfect  conductor  at  i > I. 
Conjugate  gradients  method 


Figure  6.  - Profile  of  dielectric  constant  »c{*)  = fora  slab 
terminated  by  a perfect  conductor  at  z * 1 . Random  noise 
was  added  to  the  simulated  data. 


Figure  7.-  Profile  of  dielectric  constant  k(zI  * -(5  ♦ 

+ (4  + \/Tf)z  *2  for  a dab  terminated  by  a perfect  conductor 
at  r«  I.  Plane  wave  wai  incident  ftom  -z  to  ♦*  direction. 
Cojungate  gradients  method,  k,  • 1.0. 
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V'ikjS, 

»”-4  i SS«j» 


Plane  wave  wai  Incident  from  +r  to  -i  direction.  Conjugate 
gradients  method,  i,  ■ 1. 


Figure  9.-  Profile  of  dielectric  constant  k(x)  « i ♦ I tin  2*  for 
a dielectric  slab.  Roaet-.btock't  method,  " i 


Figure  9 is  another  example  in  which  variations 
of  the  profile  function  occurring  deep  in  the  medium 
have  little  effect  on  the  surface  fields.  This  example 
illustrates  the  uncertainty  that  must  be  attached  to 
the  proposed  solutions.  Although  the  results  are  not 
particularly  good,  the  performance  index  for  this  case 
was  very  small,  leu  than  10~} , and  the  amplitudes  of 
the  scattered  fields  agreed  to  two  significant  digits. 


CONCLUSIONS 


The  three  methods  discussed  for  solving  the 
remote  probing  problem  were  developed  especially 
to  utilize  data  obtained  in  the  angular  spectrum 
domain.  The  first  method  is  a novel  technique  for 
extracting  an  exact  linear  integral  equation  from  a 
nonlinear  integral  equation.  The  second  method, 
based  on  a new  representation  of  th  . so'ntion  to  the 
differential  equation  describing  the  problem,  is  more 
stable  numerically  than  the  first. 

The  third  method  was  s'  to  bt  numerical  in 
nature  and  so  lacks  many  of  the  analytical  niceties 
that  the  first  two  methods  possess.  However,  this 
method  can  use  data  from  either  the  frequency  do- 
main or  the  angular  spectrum  domain,  and  it  can  be 
used  for  geometries  other  than  planar.  Furthermore, 
this  method  ii  numerically  stable. 
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DISCUSSION 


Grrssi:  Would  this  be  applicable  to  the  case  of  "crtical  incidence  ionograms,  the  valley  £ and  F layer? 

Mttra:  Yes  With  certain  provisions. 

Shellman:  In  the  last  example,  what  was  the  allowable  class  of  functions? 

Mittra:  We  have  not  found  the  restrictions  yet  that  one  has  to  put 
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* -SPACE  FORMULATION  OF  THE  » -DIMENSIONAL 


SCATTERING  PROBLEM 
Norbert  N.  Bojarskit 


ABSTRACT 


N7  3 “H623 


The  n -dimensional  scattering  problem  is  solved  by  means  of  a * -space  formulation  of  the  field  equations,  thereby 
replacing  the  conventional  integral  equation  formulation  by  a set  of  two  algebraic  equations  in  two  unknowns  in  two 
spaces  (the  constitutive  equation  being  an  algebraic  equation  in  jc -space).  These  equations  are  solved  by  an  iterative 
method  with  the  aid  of  the  fast  Fourier  transform  (FFT)  algorithm  connecting  the  two  spaces,  requiring  very  simple 
initial  approximations.  Since  algebraic  and  FFT  equations  are  used,  the  number  of  arithmetic  multiple-add  operations 
and  storage  allocations  required  for  a numerical  solution  are  reduced  from  the  order  of  N1  (for  solving  the  matrix 
equations  resulting  from  the  conventional  integral  equations)  to  the  order  of  N log,  N and  N,  respectively  (where 
A is  the  number  of  data  points  required  for  the  specification  of  the  problem).  The  advantage  gained  in  speed  and 
storage  Is  thus  of  the  order  of  IV/ log]  N and  N,  respectively.  This  method  is  thus  considerably  more  efficient 
than  the  conventional  matrix  method,  and  permits  exact  numerical  solutions  for  much  large:  problems.  Arguments 
are  presented  towards  the  view  that  the  field  equations  are  more  fundamental  in  fc-space.  The  details  and  some 
numerical  results  of  the  application  of  this  method  to  the  three-dimensional  electromagnetic  scattering  problem  are 
presented  as  an  example. 


INTRODUCTION  AND  BACKGROUND 


Consider  the  n-dimensional  scalar,  vector,  or  tensor  field  fix)  and  the  source  density  g(x) , governed  by  the 
linear  mth  order  differential  field  equation 


L f f{x)  = -L*  *(x)  (1) 

where  the  n-dimensional  linear  differential  scalar,  vector,  or  tensor  operators  L f and  L g are  of  the  form 


m 
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subject  to  the  n-dimensional  scalar,  vector,  or  tensor  constitutive  equation 


?(*)  = h(x)f(x) 


(4) 


The  conventional  n-dimensional  integral  representation  of  the  generalized  initial  va-ue  or  (scattering)  problem 
associated  with  the  field  Eq.  (1)  is 


f{x)  = j^G(x  |x')g(x')  dV  * fl(x)  (5) 

subject  to  the  constitutive  Eq.  (4);  where  fix)  is  the  externally  imposed  field  [initial  value,  source  distribution 
external  to  the  problem  defined  by  the  constitutive  Eq.  (4)] ; D is  the  domain  of  nonvanishing  h{x) ; and  G(x) 
is  the  appropriate  Green’s  function,  vector,  or  tensor,  satisfying  the  differential  equation 


Lf  G(x)=  -L g 6(x) 


(6) 


The  conventional  numerical  method  of  solution  of  this  initial  value  problem  is  by  means  of  numerical  matrix 
inversion  methods  Harrington  [1968] , applied  to  the  Fredholm  integral  equation  of  the  second  kind,  formed  by 
combining  Eqs.  (4)  and  (5);  that  is. 


fix)- 


L 


Kix  lx')  fix’)  dnx'  = fix) 


(V 


where  the  integral  transform  kernel  A'(x  |x')  is  given  by 

Kix  lx')  = G(x|x')A(x') 


(8) 


Such  matrix  inversion  methods  require  die  order  of  N2  computer  storage  allocations,  and  of  the  order  of 
from  <VZ  to  N*  arithmetic  muitipiy-add  operations  (for  conventional  matrix  inversions  and  matrix  inversions  by 
iteration  method:,  respectively),  for  the  execution  of  a numerical  solution;  N is  the  number  of  data  points  required 
for  the  numerical  specification  of  the  constitutive  Eq.  (4),  the  specification  of  the  nonvanishing  portion  of  h(x) . 
The  practical  size  limit  with  state-of-the-art  computers  is  for  N of  the  order  of  several  hundred. 


THE  E-SPACE  FORMULATION  OF  THE  GENERAL  INITIAL-VALUE  PROBLEM 


The  E-space  representation  and  solution  of  the  generalized  n-dimensional  initial-value  problem  is  discussed. 
The  n-dimensional  Fourier  Transform  of  the  differential  field  Eq.  (1)  yields  the  algebraic  scalar,  vector,  or  tensor 
E-space  equation 


Ljik)  F{k)  = - Lgik)  Gik) 


(9) 
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where 


F(k) 


eik'X  f(x)  d”x 


00) 


and  where,  by  virtue  of  Eqs.  (2)  and  (3),  the  quantities  Lf(k)  and  Lg{k)  are  polynomials  of  the  form 


Lf(k) 


Lg(k) 


b^ik)* 


(id 


(12) 


The  A: -space  representation  of  the  generalized  n-dimensional  initial  value  problem,  consistent  with  the  x-space 
integral  representation  in  Eq.  (5),  thus  is  the  algebraic  scalar,  vector,  or  tensor  equation  (vis-a-vis  the  conventional 
integral  or  differential  equation) 


F(k)  = r(Jfc)  G{k)  + F'(ifc) 


(13) 


subject  to  the  algebraic  x-space  constitutive  Eq.  (4);  that  is. 


g(x)  = /t(x)  /(*) 


where 


r(*)  s 


Lg(k) 

Lfik) 


(14) 


(15) 


which  clearly  can  be  taken  as  the  Green’s  function  (or  vector  or  trusor)  in  k-space,  that  is. 


r (k)~  -G(x) 


(16) 


The  generalized  n-dimensional  initial  value  problem  is  thus  reduced  to  a set  of  two  algebraic  (scalar,  vector,  or 
tensor)  equations  in  two  unknowns  in  two  spaces  - that  is,  Eqs.  (13)  and  (14). 

The  unique  existence  [Pbpoulis,  1962]  of  this  k-space  representation  is  restricted  to  media  for  which 


I drtx  < *» 


(17) 
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If  h(x)  is  in  genera!  norvanishing  only  in  a finite  rt -dimensional  x -domain,  then  the  pair  of  algebraic  equations 
(13)  and  (’4)  can  be  solved  numerically  with  the  aid  of  the  n-dimensional  fast  Fourier  transform  (FFT)  algorithm 
[Cooley  et  at.,  1967]  as  the  connection  between  the  two  spaces,  by  the  following  iterative  relaxation  method  [ Varga, 
1962] , the  recursion  relationship  for  which  is 


Gn(k ) * F(k\i)gn{x) 

Fn(k)  - r (k)  G„(k)  + FUk) 
f„{x)  = F(x  Ik)  F„(k) 

£n+l(*l  = a*(x)fn(x)  + (1  -«)£n(*)  (18) 


where  a is  an  appropriately  chosen  relaxation  coefficient  (best  numerical  results  to  date  were  obtained  for  o = 1 / 2), 
and  where  F(k  I x)  and  F(x  I k)  designate  the  fast  FourieT  transform  algorithm  operator  and  its  inverse, 
respectively. 

The  initial  approximation  g0(*)  can  be  taken  as  any  known  simply  programmable  approximation  to  the 
problem,  including  the  trivial  case  g0(x)  = 0. 

To  avoid  the  numerical  difficulties  arising  from  the  fictitious  periodic  nature  of  the  FFT,  which  is  a discrete 
and  finite  Fourier  transform,  Vis-a-vis  the  continuous  and  infinite  Fourier  transform  implied  by  Eq.  (10),  and  the 
possible  singularities  in  the  fields,  sources,  and  Green’s  function  (or  vector  or  tensor)  in  k-space,  it  becomes  necessary 
to  choose  an  n-dimensional  hyper-rectangular  box  of  twice  the  size  (in  each  dimension)  of  the  smallest  hyper-rectan- 
gular box  in  which  the  nonvanishing  h(x)  is  imbeddable  as  thex  domain  for  the  FFT,  and  take  the  Green’s  function, 
vector,  or  tensor  as 


Nt 

2 


-1 


rGi,*>, . . . ,o)  = -A”x 


X)  'll  l^lV*"-V’TGW 


y) 


Nn 


y- 


(19) 


in  the  conventional  TFT  notation  [Cooky  <•«  at.,  1967]  where 


- f.  Ixi  1 N 

(20) 

Anx  = Ax,  Axa  • • • Axn 

(21) 

JV  = Ni  A*  • - • ,Y„ 

*22) 
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and  where  appropriate  use  is  made  of  the  periodicity  properties  of  the  FFT  [Cooley  et  al.,  1967]  for  both  Eq.  (19) 
and  the  desired  placement  of  h(x)  and  f‘(x)  in  the  hyper-rectangular  FFT  x domain. 

The  numerical  difficulty  arising  from  the  possible  singularity  of  the  Green’s  function  at  the  origin  of  the  x space 
(that  is,  G(0)  = °°)  can  be  alleviated  by  taking  advantage  of  the  appropriate  principal  value  integral  representation 
of  the  field  Eq.  (5),  such  as,  [Kan  Bladel,  1961] 


f{x)  = Bg(x)  + G(x  I x')  g(x')  dV 
It  thus  immediately  follows  that  G(0)  for  Eq.  (19)  can  be  taken  as 


(23) 


G(0)  = 


g 

A«x 


(24) 


Since  algebraic  and  FFT  equations  are  used,  the  number  of  arithmetic  multiply-add  operations  required 
[Singleton,  1969]  for  a solution  is  reduced  from  the  order  of  V2  (required  for  solving  the  matrix  equations  result- 
ing from  the  conventional  integral  equation  representation  of  the  problem)  to  the  order  of  N log2  N,  and  the  stor- 
age requirement  is  red  iced  also  from  the  order  of  N2  (required  for  storing  the  matrix  associated  with  the  matrix 
method  of  solution)  to  the  order  of  N (requited  for  storing  the  k-  and  x-space  vectors).  The  advantage  gained  in 
speed  and  storage  is  tl  us  of  the  order  of  A^/logj  TV  and  N,  respectively.  This  method  is  thus  considerably  more 
efficient,  and  permits  exact  numerical  solutions  for  much  larger  problems.  To  date,  problems  of  the  order  of  N = 104 
have  been  successfully  solved;  and  problems  of  the  order  of  106  seem  feasible  with  state-ol-the-art  computers 
[Singleton,  1970]. 


THE  k-SPAC£  FORMULATION  OF  THE  GENERAL  WAVE  SCATTERING  PROBLEM 


For  the  general  n-dimensional  wave  scattering  problem,  the  (range-  and  phase-normalized)  scattered  far  fields 
in  x space  are  in  general  simply  and  algebraically  related  to  the  n-dimension?i  Fourier  transform  of  the  source  distri- 
bution g(x)  - that  is,  G(k)  - induced  by  the  incident  field  f‘(x) ; G(k)  clearly  is  yielded  directly  by  the  iterative 
solution  in  Eq.  (IS)  without  additional  computations.  Since  this  is  not  the  case  with  the  conventional  matrix  method 
of  solution  of  the  integral  equation  representation  of  the  scattering  problem,  the  k-space  method  of  solution  pre- 
sented is  particularly  and  additionally  attractive  when  applied  to  scattering  problems. 

For  the  special  case  of  the  n-dimensional  Helmholtz  (time-reduced)  wave  equations  for  which 


+ k0 2 


(25) 


where  k0  s cj/c  , and  c is  the  wave  velocity  in  free  space  (this  deviation  trom  conventional  notation  is  fcr  the 
purpose  of  distinction  from  k , the  Fourier  transform  variable  of  x),  the  n-dimensional  k-space  Green’s  function, 
in  the  notation  of  (9),  (IS),  and  (16),  is  clearly 
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r(*)  = *(*)  l g(k) 


(26) 


The  form  of  the  Green’s  function  $(&)  in  k space  is  clearly  invariant  to  the  dimensionality  n of  the  space, 
which  is  not  the  case  in  x space. 

Thus,  by  virtue  of  the  previously  stated  relationship  between  the  (range-  and  phase-normalized)  scattered  far 
fields  in  x space  and  the  source  distribution  G(k)  in  k space,  and  conservation-of-energy  considerations  for 
passive  media  of  finite  spatial  extent  [see  the  inequality  (17)] , it  follows  that  the  radiation  condition  for  the  Helmholtz 
equation  in  k space  can  be  stated  as 


Hks)  < 

<X*j) 


I*,  I = * o 


(27) 


where  ks  is  the  propagation  wave  number  vector  of  the  scattered  far  fields;  see  also  (37)  through  (40). 

It  can  thus  be  argued  that  the  field  equations  of  .natncmatical  physics  are  more  fundamental  in  k space,  be- 
cause of  the  simple  algebraic  nature  of  these  equations  in  k -pace,  and  the  invariance  of  the  form  of  the  k space 
Green’s  function  of  the  Helmholtz  equation  with  respect  to  fie  dimensionality  of  the  space,  particularly  when  bearing 
in  mind  that  the  Fourier  transform  is  the  only  transform  known  for  which  a fast  algorithm  exists.  (This  apparent  more 
fundamental  nature  is  most  obvious  for  quantum  mechanics.) 


ELECTROMAGNETIC  SCATTERING 


Three-dimensional  electromagnetic  monochromatic  scattering  by  passive  inhomogeneous  media,  including  per- 
fect conductors,  of  finite  spatial  extent  and  arb^ary  shape,  is  considered. 

The  time-reduced  electric  ai«t  .nagnetic  field  wave  equations,  valid  for  all  linear  inhomogeneous  media,  in  terms 
of  the  total  current  density  [Jirattoib  1941] , are,  respectively. 


V XVX£(x)  - *0*  £(*)  = iWMo  /(*) 

VXV*H(x)  - ko1  H(x ) = VX7(x) 

which,  with  the  aid  of  Maxwell’s  first  and  second  equations,  and  the  equation  of  continuity  for  the  total  charge  and 
current  density,  can  be  written  as 


(28) 

(29) 


VJE  + /to1  E = -iuuo[j  + 


W'J  ) 


(30) 


V'H  + k^  H = -VXy 


(31) 
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For  nonmagnetic  media  and  perfectly  conducting  media,  the  appropriate  constitutive  equations  for  the  total 
volume  and  surface  current  density  J ( x ) and  K(x),  respectively,  are 


J - {Of  - i uxe)  F 

(32) 

K = n X H 

(33) 

where  of  and  xe  are  the  free  charge  conductivity  and  electric  susceptibility,  respectively,  of  a (nonmagnetic)  medium 
and  n is  the  otward  surface  unit  vector  of  a perfectly  conducting  medium. 

Equation  (33)  is  usually  regarded  as  a boundary  condition  for  perfect  conductors,  but  in  the  context  of  this 
paper, it  must  be  taken  as  a constitutive  equation  in  the  truest  sense,  particularly  if  regarded  as  a geometrically  con- 
straining condition  on  the  flow  of  all  charges. 

For  the  electric  and  magnetic  field  equations,  consistent  with  Eqs.  (13)  and  (26)  the  appropirate  k-space  field 
equations  and  Green’s  tensor  and  vector  are,  respectively, 


E(k)  = r(*W(*)  + £'(*) 

r(*)  S o — 

(34) 

H(k)  = r (k)  X J(k)  + W(k) 

f(lt)  = ik  1>(fc) 

(35) 

and,  by  the  relationship  between  surface  and  volume  current  densities,  consistent  with  the  FFT  notation  of  (19), 
the  volume  current  density  for  a perfectly  conducting  medium,  can  be  written  by  Eq.  (33)  as 


J = 


— nXH 
Av 


As 

A3* 


X H 


(36) 


where  As  is  the  (finite)  differential  vector  surface  area  in  the  FFT  cell  of  volume  Av  = A3x . 

Equations  (32),  and  (34)  and  (33),jnd  (35)  can  now  be  solved  numerically  by  Eq.  (18).  Defining  the  range 
and  phase  normalized  scattered  far-fteld  !>(ks)  as 

S(ks)  = _Limit  lx  I Fs(x)  e-,*s  ’ x (37) 

lx  I ■+«• 
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where  F%x)  is  the  scattered  field  satisfying  the  relationship  F * F*  + Fs  , which  is  consistent  with  the  conven- 
tional definition  [Berkowitz,  1965a]  of  the  radar  powsr  cross  section  a and  the  relationship 


o = S • S* 


(38) 


readily  reveals  that  the  range  and  phase  normalized  electric  and  magnetic  scattered  far  fields  Se  and  Sm  are  given 
directly  by  the  fc-space  current  density  distributions 


$e(*s)  = [j'lfAr)  -'££•/(*)]  _ _ 

V4jT 


(39) 


S m(%s)  = — ’ — k X.  J (Ac)  I _ (40) 

V4?  I*  = *s 

where  zu  is  the  impedence  of  free  space.  As  dictated  by  the  transversality  of  the  scattered  far  fields  in  free  space, 
Se.  Sm,  and  ks  are  indeed  all  orthogonal  to  each  other. 

It  can  thus  be  shown  that  the  conventionally  defined  [Berkowitz,  1965b]  electric  polarization  scattering  matrix 
is  given  directly  in  k space  by 


= 


ik0Zo 

V4tr 


% * Hk) 


k = ks 


(41) 


where,  in  the  conventional  notation  for  spherical  coordinates,  are  the  eigenpolarizations  (spherical  coordinate 
unit  base  vectors)  and  Ts  associated  with  the  scattered  far-field  pro;  agation  vector  ks  and  J{k)  is  the  k -space 
current  density  induced  by  an  electric  incident  plane  wave  field  of  the  form  and  polarization 


Ei(x)-^eikrx  (42) 

where  are  the  eigenpolarizations  (spherical  coordinate  unit  base  vectors)  and  'Sj  associated  with  the  incident 
propagation  vector  kf. 

The  solution  to  the  fc-space  formulation  of  the  three-dimensional  scattering  problem-for  the  electric  field 
Eqs.  (32)  and  (34)  for  nonmagnetic  media,  and  the  magnetic  field  Eqs.  (35)  and  (36)  for  perfect  conductors-has 
been  numerically  computer  executed  for  a limited  number  of  cases  by  the  iterative  method  of  solution  of  Eq.  (18), 
with  final  results  within  about  one  decibel  of  exact  known  analytic  closed-form  solutions  after  about  30  iterations. 
Figures  1 through  5 compare  this  technique  with  the  exact  solution  of  Mie  [1908]  for  the  perfectly  conducting 
sphere  (of  radius  a).  The  failure  of  this  Jt-space  technique  ir.  the  near  vicinity  of  k0a  = 2.75  (fig.  1)  is  due  to  the 
fact  that  k0a  = 2.75  is  the  occurrence  of  the  first  eigcnfrequency  (internal  resonance  of  perfectly  spherical  shell) 
[Oshiro  et  al,  1970] . This  difficulty  can  be  readily  and  simply  alleviated,  but  since  the  objective  of  this  project  was 
to  prove  the  feasibility  and  merits  cf  the  k-tpace  method,  and  not  die  generation  of  an  operational  user  library  of 
computer  programs,  this  difficulty  was  taken  as  beyond  the  scope  of  this  project. 
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QUANTUM  SCATTERING 


$ 


Three  dimensional  nonraelivistic  Schrodinger  scattering  at  constant  energy  by  strong  finite  short-ranged  nucleai 
potentials  K(x)  is  considered.  The  time-independent  Schrodinger  equation  in  center-of-mass  coordinates  can  be 
written  in  x space  as 


(V1  + *„J)  Hx)  * -P(i') 


where 


. j - 2 tnE 
k ° =~ 


(43) 


(44) 


Figure  1.  - Power  cross  section  versus  Frequency. 


Figure  2.  - Power  cross  section  versus  aspect  angle; 
M * 1.7;  p,  « 0 


The  Lutial-vaiue  integral  representation  [Rod berg  and 
Thaler,  1967]  of  which  is 


^(x)  + “ f X.xlx)  v(x)  \p(x)  d*x'  + \li  ‘(x)  (46) 

h JD 


Consistent  with  Eqs.  ( J 3)  and  (26),  the  appropriate 
Jfc-space  Green’s  function  is  Cius  given  by 


gik0 1 x I 
4n  1*  I 


(47) 


Figure  S.  - Power  crou  section  versus  aspect  angle; 


ard,  consistent  with  Eqs.  (13)  and  (14),  the  Jk-space  ’^presentation  of  the  initial-value  Schrodinger  field  equation  is 

• *(k)p(k)  + ♦/(Jk)  (48) 

subject  to  the  x-space  constitutive  equation 

p(x)  « - ~”-v(x)  *(x)  (<7) 

Consistent  with  the  inequality  (1^);  the  unique  existence  of  this  fc-space  representation  is  restricted  to  poten- 
tials satisfying  die  relationship 


S t K(x)  I d*x  < - (SO) 

and  the  iterative  numerical  solution  (18)  is  applicable  if  the  potential  F(x)  vanishes  at  a finite  distance.  The  recur- 
sion relationship  for  this  iterative  solution  dearly  is 

Pn(*)  * F(klx)  p„(*j 
*„(*)  - *(*)  p„(fc)  + *'(*) 

+n(x)  - F(xlk)  *„(*) 

2m  — 

Pj,+)(x)  ■ + (!-«*)  P*(*)  (51> 

Consistent  with  Eqs.  (37)  through  (40),  die  ran  go-  and  phaae-nonnalixed  scattered  far-fWd  wave  functions 
S(Jkj)  are  yielded  directly  in  Jk-space  as 
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1 


(52) 


S(k)  = p(*)  _ 

V4n  = 


and  the  differential  and  total  scattering  cross  sections  Oj  and  of  are 


od(ks)  = S(fcj)  • S*(*j> 


■ *0^  £ 


• da* 


where  is  the  area  of  the  spherical  surface  in  k space  of  radius  k0. 

If  the  potential  is  a nonlocal  potential  representable  by  a difference  kernel  - for  example, 


2m  f 

t?  J 


v^x-x1)  ip(x')  d3x' 


the  constitutive  Eq.  (49)  becomes  a local  equation  in  k space: 


P(*)  = ~~  V<k)  nk) 


and  the  scattering  problem  represented  now  by  Eqs.  (48)  and  (56)  can  be  solved  ir.  closed  form  in  k space: 


*(*)  = ?(*)  *(*)  •(*)  + *'(*:) 


*(*)  - 


**(k) 

2 vtt 

l+-g-v(k)*(k) 
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DISCUSSION 


Unidentified  speaker:  What  do  you  mean  by  the  "fait"  matrix  form?  You  mean  the  fact  that  you  can  do  it  quickly? 

Bofanki:  Well,  that  is  a good  way  of  describing  it.  It  is  basically  a Fourier  transform.  If  you  write  Eq.  (10)  and  if 

you  compute  it  numerically  and  it  were  to  go  straightforward  it  would  take  you  n 1 operations  to  obtain  n 
values.  But  if  you  cleverly  use  the  periodicity,  which  was  done  at  the  turn  of  the  century  but  hat  only  been 
programmed  recently,  you  will  And  out  that  you  don't  have  to  compute  n1  operations  but  only  n logn . By 
the  way,  we  did  do  it  in  three  dimensions,  to  you  can  do  three-dimensional  problems  and  larger. 

Kay:  I think  that  this  is  a very  clever  use  of  the  fast  Fourier  transform,  and  1 think  maybe  something  could  be  done, 

but  I can  see  a fundamental  difficulty.  Your  second  equation  is  an  integral  equation  which  is  a convolution, 
and  therefore  could  theoretically  he  inverted 

Bofanki:  Let  me  rewrite  it  so  we  will  know  what  we  are  talking  about.  The  first  one  is  Eq.  ( 1 3)  and  the  second  one 

isEq.  (14). 

Kay:  Okay.  Now,  in  the  first  one,  /(£)  is  the  Fourier  transform  of  the  product  of  two  functions  of  x . Write  it 

out  in  terms  of  the  unknown  function  o(x)  as  a convolution,  an  integral  equation  of  the  convolution  form 
that  can  be  inverted  by  the  Fourier  transform. 

Bofanki:  Yes,  but  that  is  precisely  the  thing  to  avoid,  to  put  it  in  the  convolution  form. 

Kay:  I understand  that,  but  I am  just  commenting  that  die  success  of  die  fast  Fourier  transform  seems  to  depend 

on  this  fact,  and  the  problem  with  the  Gelfand-Levitan  methods  in  the  inverse  problem  is  that  the  kernel  of 
the  integral  equation  has  the  righ  t form  but  it  isn’t  one  that  can  be  inverted  by  the  Fourier  transform,  so  it  is 
not  quite  clear  that  this  technique  would  work  in  that  case,  for  example. 

Bofanki:  Perhaps  we  shouldn’t  go  to  that  but  go  back  to  the  original  differential  equations.  I needn’t  have  written 

it  for  this  operator.  In  fact,  for  the  electromagnetic  case  it  is  of  the  form  del  cross  del  cross,  which  you  put  in 
other  forms.  Basically  this  technique  will  work  so  long  as  you  split  apart  a physical  law  that  has  a differential 
operator  on  a field,  giving  you  a source,  and  a constitutive  equation  that  says  the  source  is  related  to  the  field. 

As  long  as  this  is  a linear  differential  equation  the  technique  will.  work. 

Kay:  Yes,  but  it  does  depend  on  using  a Fourier  transform,  so  this  means  that  the  equivalent  integral  equation  has 

to  be  one  that  can  be  inverted  by  the  Fourier  transform.  I am  not  posing  this  as  a fundamental  objection.  It 
is  just  that  1 can  see  a difficulty  arising  because  of  that  question. 
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INVERSION  OF  RADIATION  DATA  IN  BIOPHYSICS 


Victor  Twcrsky 
University  of  Illinois 


I N73-H624 


This  summarizes  the  essentials  of  my  informal  supplementary  talk  during  the  panel  discussion  (final  workshop 
session)  and  incorporates  some  remarks  relevant  to  comments  from  the  floor. 

There  are  essentially  two  sets  of  topics  in  biophysics  that  are  appropriate.  The  first  relates  to  how  we  acquire 
information  about  our  distant  environment  through  seeing,  hearing,  and  so  forth.  The  second  relates  to  how  we 
use  electromagnetic,  acoustic,  or  other  radiation  for  diagnostic  purposes,  either  at  a bulk  or  a molecular  level. 

The  first  set  includes  inversion  aspects  not  covered  by  this  conference,  in  that  the  mapping  characteristics  of 
the  receiver  (the  sensor-perceptory  system)  are  as  important  as  the  scatterers.  An  elegant  example,  Luneburg’s  [1948, 
1950)  theory  of  binocular  vision  [Blank,  1953, 1961]  was  motivated  by  work  on  stereoptical  and  optical  localization 
equipment  (such  as  long-arm  binoculars  to  widen  the  base  for  triangulation).  Luneburg  was  able  to  account  foi  various 
results  in  the  literature  of  experimental  psychology,  particularly  for  the  noncoincidence  of  parallel  alley  and  equidis- 
tant alley  test  data,  by  a model  based  on  non-Eudidean  geometry:  under  restricted  conditions,  the  subject  exposed 
to  optical  stimuli  in  euclidean  three-space  maps  them  into  a perceptual  three-space  of  constant  negative  curvature. 

As  another  example,  consider  the  development  of  acoustic  guidance  devices  to  aid  the  blind  in  foot  travel 
[Twenky,  1947, 1951a,h] , such  as  hand-held  higi-frequency  sound  sources  to  supplement  or  improve  on  ambient 
noise  (or  heel  clicks,  etc.)  for  obstacle  avoidance  by  hearing.  The  average  increase  in  scattered  sound  intensity  at 
the  ear  as  an  obstade  is  approached  provides  two  kinds  of  auditory  cues:  because  of  the  nonlinear  behavior  of  the 
hearing  mechanism,  an  increase  in  intensity  of  the  high-frequency  stimulus  is  perceived  as  an  increase  in  pitch  as  well 
as  loudness. 

The  second  set  of  relevant  topics,  radiation  diagnostics,  involves  the  same  fundamental  physical  processes  that 
motivated  the  present  conference.  Thus,  the  mathematical  developments  that  have  been  presented  are  as  significant 
to  scattering  phenomena  in  biophysics  as  in  geophysics,  quantum  physics,  etc.  The  inversion  of  x-ray,  optic,  or  ultra- 
sonic scattering  and  propagation  data  to  determine  the  density  profile  of  tumors  and  of  tissue  layers,  the  shape  of 
foreign  objects  in  the  body,  the  velocity  distributions  in  vital  fluid  systems,  and  the  like  [Medical Physics,  1944, 1950, 
1960]  provide  full  analogs  of  the  data  inversion  problems  already  considered. 

In  addition,  there  are  various  many-obstacle  problems,  periodic  arrays,  or  statistical  distributions,  that  are  of 
interest.  Although  a many-obstacle  configuration  can  be  thought  of  as  one  scatterer  in  disjoint  pieces  (so  that  the 
analytical  procedures  already  discussed  are  applicable),  it  is  usually  more  efficient  to  exploit  the  known  results  for 
the  components  as  isolated  scatterers.  We  represent  the  many-obstacle  solution  (for  nonpenetrating  scattering  poten- 
tials) as  a functional  of  isolated-scatterer  functions,  and  concentrate  on  the  novel  aspects  arising  from  the  configura- 
tion [Twenky,  1952;  1962a,i> ; 1967 a;Burke  et  al.,  1965]  of  scatterer  locations,  or  their  statistical  distribution 
[Foldy,  1945, 195!,  1952; Keller,  1964;  Twenky.  1959, 1962c, d,  1964,  1?69]. 

A well-known  example  of  a biologically  significant  config  ration  discovered  in  part  by  i;  'riing  x-ray  scattering 
data  [Langridge  et  al.,  1960]  is  the  double  helix  of  DNA.  (The  ' ' part”  is  stressed  because  muc*»  of  the  information 
used  by  Watson  and  Crick  come  from  direct  chemical  analy-ir  • b i ratios  of  the  four  bases  that  distinguish  the 
nucleotides,  from  electrometric  titration,  from  viscosity  studies,  and  so  forth;  but  one  does  not  play  a gentleman’s 
game  with  nature ) The  x-ray  data  are  obtained  from  measurements  on  a relatively  thick  fiber  consisting  of  billions 
of  aligned  DNA  molecules.  In  the  plane  perpendicular  to  die  fiber’s  axis,  the  distribution  of  the  helices  should  be 
essentially  that  of  a two-dimensional  liquid;  thus,  the  fiber  data  may  show  additional  structure,  analogous  to  that 
shown  by  x-ray  data  for  dense  gases  and  liquids  [Guinier,  1963] , from  which  die  two-dimensional  pair-distribution 
function  could  be  obtained  by  inversion. 
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A single-scattering  approximation  for  the  nucleotides  suffices  for  x-ray  studies  on  DNA.  However,  for  ultra- 
violet studies,  the  interactions  of  the  fields  of  the  nucleotides  must  be  taken  into  account.  As  discussed  by  Weissbluth 
[1971 ) , the  ultra viole*  attenuation  is  smaller  for  a solution  of  DNA  helices  than  for  a solution  of  random  coils  (as 
obtained,  for  example,  by  heating),  because  the  applied  field  couples  die  oscillator  strengths  (electric  dipoles)  of 
neighboring  nucleotides  in  a helix. 

As  a final  example  we  consider  the  inversion  of  optical  data  to  determine  the  oxygen  saturation  R of  blood: 
the  ratio  R of  oxygenated  to  total  hemoglobin  is  one  of  the  principal  parameters  of  interest  in  clinical  medicine. 

If  a sample  of  blood  is  hemolyzed  - that  is,  the  hemoglobir  is  released  from  the  red  blood  cells  - then  R for  the 
hemoglobin  solution  can  be  determined  by  optical  absorption  measurements  at  two  different  wavelengths.  (This  is 
essentially  a refined  version  of  observing  the  color  of  blood,  bright  red  for  high  R as  in  arterial  blood,  and  dark 
red  for  low  R as  in  venous  blood.)  To  avoid  sampling,  optical  instruments  (transmission  oximeters  for  the  shell  of  the 
ear,  reflection  oximeters  for  the  forehead,  fiber-optical  catheters  for  insertion  in  blood  vessels)  and  empirical  calibra- 
tion procedures  have  been  developed  for  continuously  monitoring  R during  anesthesia,  spaceflights,  etc.  To  design 
more  versatile  instruments,  the  inversion  procedure  [Loewinger  et  al„  1 969 ; A nderson  and  Sekelj,  1967a,6]  must 
take  account  of  the  scattering  effects  [7Werrit>\  1970]  of  the  large  red  blood  cells  in  which  the  hemoglobin  molecules 
are  packaged.  For  transmission  oximetry  on  suspensions  of  whole  red  blood  cells,  the  main  beam  is  attenuated  by 
scattering  to  other  directions  as  well  as  by  molecular  absorption;  in  addition,  there  are  the  enrichment  effects  of 
scattering  into  the  beam.  For  a reflection  system,  incoherent  scattering  interface  effects  are  also  significant 
[Twersky,  1970]. 

In  connection  with  comments  cm  measuring  phase  as  well  as  amplitude,  it  should  be  nc'.ed  that  various  para- 
meters sought  by  inverting  the  total  average  intensity  and  coherent  phase  shift  could  be  obtained  more  directly  from 
averages  of  several  products  of  the  phase  quadrature  components  of  the  field  [Twenky,  1963, 1967  b . Beard  etal., 

1 967 ; Hawley  et  aL,  1967] . The  components  may  be  measured  with  a coherent  source  and  a double  interferometer 
analogous  to  the  double-bridge  heterodyne  system  developed  for  millimeter  wave  measurements  on  scattering  by 
large-scale  random  distributions  [7Wenjty,  1963, 1961  b\ Beard  et  aL,  1967 ; Hawley  et  aL,  1967].  Such  systems 
enable  us  to  separate  die  coherent  and  incoherent  intensities  even  in  the  forward  direction,  to  measure  new  combi- 
nations of  the  fundamental  parameters,  and  to  seek  data  in  which  the  many-particle  distribution  functions  enter  in 
first  order  instead  of  as  small  corrections. 
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TRANSMISSION  LINE  INVERSION  AND  SYNTHESIS 
FROM  THE  POINT  OF  VIEW  OF  TRANSIENT  RESPONSE 


M.  M.  Sondhi 


Bell  Telephone  Laboratories,  Inc.,  Murray  Hill,  New  Jersey 


The  work  described  here  is  a joint  effort  of  Sondhi  and  Gopimth  [ 1 97 1 a,b ; Gopinath  ana  Sondhi,  1971]. 

Although  the  problem  is  discussed  as  one  of  transmission  line  inversion  (or  synthesis),  the  motivation  for  study- 
ing it  comes  from  research  in  speech  communication.  Since  this  field  of  research  is  rather  remote  from  the  other 
disciplines  represented  here,  we  first  briefly  consider  the  problem  in  its  original  form  and  then  reduce  it  to  one  of 
transmission  lines,  indicating  the  most  recent  results  obtained. 

Figure  1 is  a cross  section  of  the  human  vocal  tract.  To  produce  speech  sounds  we  acoustically  excite  this  tract 
by  means  of  quasi-periodic  pulses  from  the  vocal  cords  during  vowel  sounds  and  turbulent  air  flow  at  various  points 
along  the  tract  during  fricative  sounds  such  as  s,  sh,  and  f.  If  we  assume  plane  wave  propagation  (which  is  a good 
approximation  for  propagation  of  waveforms  limited  to  about  3500  Hz),  then  the  cross-sectional  area  C(x),  the 
pressure  p(x,  t ),  and  the  volume  velocity  u(x,  t ) , which  is  defined  as  C(x)  times  the  particle  velocity,  are  related 
through  the  differential  equations 
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where  the  units  are  chosen  so  that  the  density  of  air  and 
the  velocity  of  sound  are  both  unity.  We  identify  x = 0 
with  the  lips  and  make  the  further  normalization 
C(0)  = 1 - that  is,  the  lip  opening  has  unit  area.  These 
three  normalizations  completely  specify  a mass-length- 
time system  of  units. 

Now  if  the  shape  of  the  tract  is  known  - that  is, 
in  our  idealization  to  plane  waves,  C(x)  is  known  - 
and  the  excitation  is  known,  we  can  compute  the  pres- 
sure at  the  lips  - the  speech  wave.  This  computation 
involves  solution  of  Eqs.  (1)  for  given  boundary  conditions. 


Tongue 


Vocal  Cords 
(Glottis) 


Figure  1. 


The  inverse  problem  of  interest  here  is  to  infer  C(x) 

from  acoustic  measurements  at  the  lips  That  is  the  sort  of  information  one  requires  for  synthesis  of  speech,  making 
models  of  the  speech  production  mechanism,  and  so  forth. 

The  problem  becomes  a transmission  line  problem  if  we  replace  the  pressure  p(x,  t ) by  the  voltage  v{x,  t ) 
and  the  volume  velocity  u(x,  t ) by  the  current  i(x,  t).  With  these  identifications  Eqs.  (1)  become 
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which  are  the  equations  discussed  by  Kay  (Chap.  6)  except  that  he  used  the  steady-state,  frequency  domain  version 
with  d/d t replaced  by  iw.  in  fact,  Eqs.  (2)  are  the  equations  for  a special  type  of  line  for  which  C(x)  is  the 
capacitance  per  unit  length  and  also  die  reciprocal  of  the  inductance  per  unit  length.  This  reciprocal  relation  between 
capacitance  and  inductance  eliminates  the  inherent  ambiguity  of  the  general  transmission  line  inversion  problem. 
Recall  the  fact  brought  out  by  Kay  that  input  measurements  alone  cannot  yield  both  the  inductance  and  the  capaci- 
tance of  an  arbitrary  line.  Only  one  function  derived  from  these  quantities  can  be  reconstructed.  This  function  may 
be  chosen  as,  for  example,  the  characteristic  impedance  as  a function  of  electrical  distance  or  some  other  equivalent 
form.  In  our  ca«,  because  of  the  reciprocal  relation  the  function  to  be  reconstructed  may  be  chosen  to  be  C(x) 
itself. 


The  end  x - 0 of  this  transmission  line  is  accessible  to  us  for  measurement.  As  the  Eqs.  (2)  are  linear  it  is  clear 
that  if  we  can  measure  the  impulse  response  at  the  input  then  we  have  measured  all  we  possible  can.  All  other  measure- 
ments can  be  predicted  from  the  impulse  response  by  superposition.  The  impulse  response  is  defined  here  to  be  the 
voltage  h(t)  developed  at  x = 0,  t units  of  time  after  a unit  impulse  of  current  is  applied  to  the  quiescent  line  at 
x = 0.  It  can  be  shown  that  h(t)  = 5(f)  + h(t)  where  h(t ) is  continuous  if  C(x)  is  continuously  differentiable. 

(We  assume  this  degree  of  smoothness  for  C(x),  although  this  is  not  necessary.) 

With  this  definition  of  the  impulse  response  if  the  transmission  line  is  quiescent  at  t = t0  -that  is, 
v(x,  t0)  = i(x,  tQ)  = 0 for  all  x > 0 - then  the  input  voltage  and  current  are  related  by  the  convolution 


v(0,f)  * i(0,f)+  J h(t-r)  i(0,r)  dr 
fo 


Onr  problem  may  now  be  stated  precisely  as  follows:  We  are  given  a transmission  line,  described  by  Eqs.  (2), 
with  C(x)  continuously  differentiable  and  0<C(x)<°°.  At  x = 0 we  measure  h(t)  for  the  interval  0<t<T. 
Can  this  measurement  tell  us  anything  about  C(x)  ? 

Since  the  velocity  of  propagation  is  unity,  it  is  clear  that  h(t)  for  t < T is  unaffected  by  the  properties  of  the 
line  for  x > T/2.  Those  remote  regions  of  the  line  just  cannot  be  probed  in  so  short  a time  from  the  input.  Thus 
h(t)  for  t<T  tells  us  nothing  about  C(x)  for  x>TI  2.  The  remarkable  thing,  however,  is  that  it  gives  C(x) 
uniquely  for  x < T/2. 

The  basic  result  is  as  follows:  With  h(t)  given  for  r<T  choose  any  «<r/2;  solve  the  integral  equation 


m +“■  J '*’(  Ir-T  l)Ar)dr  - 1 IfKa  (3) 

2 •'—a 


for  the  unknown  function  /(r).  Then 
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(4) 


0 

no  ^ = 


no  it  = 


C(x)  4x 


which  solves  the  problem. 

The  argument  leading  to  this  result  is  simple  and  is  based  on  the  notions  of  causality  and  charge  conservation. 
Suppose  the  line  is  quiescent  at  t--a,  at  which  time  a current  i(0,  t ) starts  to  flow  in  at  x * 0.  Causality  and 
unit  propagation  velocity  then  assure  us  that  at  f = 0 the  line  will  be  undisturbed  for  x > a.  Thus,  the  charge  de- 
livered by  the  input  current  must  appear  somewhere  on  the  portion  x < a of  the  line.  From  charge  conservation, 
therefore. 


What  one  can  show  is  that  if  i(o,  r) 
Thus,  we  obtain  Eq.  (4). 


/ i(o,  f)df  = / v(x,  o)C(x)dx 

J-a  Jo 


(5) 


» /(f),  where  f(t)  is  the  solution  of  Eq.  (3)  ther  v(x,  o)  = 1 for  all  x < a. 


To  obtain  C(x)  one  has  to  differentiate  the  integrated  capacitance  obtained  from  Eq.  (4).  For  numerical 
computation  this  is  a very  undesirable  operation.  It  turns  out  to  be  unnecessary,  for  a simple  manipulation  of  Eqs. 
(3)  and  (4)  shows  that 


/»  - C(o) 


(6) 


Note,  however,  that  differentiation  of  Eq.  (4)  with  respect  to  a does  not  give  /(a)  = C(a)  because  /(f) 
in  that  equation  depends  on  a implicitly. 

The  impulse  response  is  a positive  definite  operator;  mathematically,  this  means  that  the  inequality 

/•a  -a  -a 

/ g1  (f)  it  J J £(lf-Tl)*(f)*(T)<if  >0 

'-a  "4-ot  oi 

holds  for  any  square  integrable  function  g(t)  on  the  interval  I f 1 < a < 772.  Physically,  this  inequality  states  that 
an  input  current  g(t)  delivers  a positive  amount  of  energy  to  the  line  because  of  die  passitivity  of  the  line  and 
because  C(x)  # 0 or  «-» . This  positive  definite  property  ensures  the  existence  and  uniqueness  of  the  solution  of 
Eq.(3). 

In  conclusion  it  should  be  noted  that  if  h(t)  is  specified  a priori  rather  than  measured,  we  have  a synthesis 
problem.  The  theory,  of  course,  applies  equally  well  to  this  case  provided  the  specified  function  is  a valid  impulse 
response.  Clearly  h(t)  must  satisfy  the  two  necessary  conditions  discussed  earlier  - that  is,  it  must  be  positive 
definite  and  of  the  form  6(f)  + h(f).  We  can  show  that  these  conditions  are  also  sufficient  to  construct  C(x). 

At  least  f continuous  h(t)  and  continuously  differentiable  C(x)  the  theory  is  complete  in  the  sense  in  which 
Sabatier  used  the  term  (Chap.  S).  That  is  to  say  the  relation  h (f)  «■  C(x)  is  unique  in  each  direction. 
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DISCUSSION 


Moses:  I want  to  say  I am  very  much  impressed  with  this  paper,  and  I will  tell  you  some  reasons  why.  First  of  all, 

I want  to  correct  an  impression.  The  method  of  Gelfand-Levitan,  the  way  Irv  Kay  does  it  does  hold  for  time, 
so  it  can  be  put  in  the  time-dependent  form.  You  can  send  in  a time-dependent  impulse  and  get  something 
out.  There  is  no  difficulty  whatever.  In  fact,  he  presented  it  in  the  time-dependent  form,  actually,  when  he 
did  it  through  causality,  with  a 6 function  and  so  on.  He  immediately  transformed,  but  it  is  completely  un- 
necessary to  transform. 

Sondhi:  Yes,  but  the  viewpoint  has  always  been  in  terms  of  steady-state  behavior  as  far  as  I know. 

Moses:  You  don’t  have  to  do  it  that  way,  though,  with  Gelfand-Levitan. 

Sondhi : Perhaps  not.  Only,  they  have  never  pursued  it  this  way. 

Moses:  Well,  in  Irv’s  paper,  it  will  be  this  way.  lie  has  published  elsewhere,  as  a matter  of  fact.  I have  to  give  credit 

to  Irv  for  recognizing  many  years  ago  that  the  Gelfand-Levitan  equation  was  an  aspect  of  causality , just  as  this 
is.  Now,  you  seem  to  have  a different  aspect  of  causality.  This  is  what  makes  it  very  interesting. 

When  Irv  and  I worked  on  the  three-dimensional  Gelfand-Levitan  theory  we  tried  to  derive  the  causality 
and  got  absolutely  stuck,  and  we  couldn’t  do  it.  We  had  to  go  to  the  spectral  theory.  But  I think  causality  is 
the  proper  way  to  do  all  these  problems. 

Sondh1:  Well,  I would  like  to  make  one  additional  point  as  far  as  causality  is  concerned.  For  the  specific  problem 

that  we  were  interested  in,  if  you  use  the  frequency  approach,  you  must  know  first  of  all  the  end  condition  at 
the  glottis.  Now,  this  end  condition  is  never  really  known.  You  could  assume  that  the  tube  is  closed  at  that 
end,  but  this  assumption  would  not  always  be  valid. 

Our  method  doesn’t  require  any  information  about  that.  It  builds  up  the  cross-sectional  area  function 
starting  from  the  end  at  which  you  are  making  measurements,  far  uown  as  you  can  go,  depending  on  how 
many  seconds  of  the  impulse  response  you  have. 

Parry:  For  this  particular  application,  wouldn’t  it  be  more  natural  to  start  the  signal  at  one  end  and  observe  it  at 

the  other  end? 

Sondhi:  But  that  is  what  we  are  trying  to  get  around,  because  we  can’t  put  a known  signal  into  the  glottis.  That’s 

the  problem,  ideally,  what  you  would  like  to  do  is  to  get  the  area  function  directly  from  the  speech,  but  you 
don’t  know  what  the  excitation  is.  You  can  only  make  some  reasonable  assumptions  about  it.  However,  that 
might  produce  a large  error  in  the  reconstructed  area. 

Now  the  measurement  problem  is  not  easy  because  you  have  to  ask  the  person  to  mouth  the  words  vitli- 
out  actually  speaking.  While  you  are  making  the  measurement  you  don’t  want  the  speech  to  interfere  with  it. 

Many  linguists  don’t  like  this,  because  they  say  that  you  are  not  measuring  the  shapes  you  would  produce 
in  normal  circumstances.  How  important  this  effect  is  r»o  one  really  knows.  However,  in  measuring  the  2rea 
function  no  assumptions  are  involved  about  the  excitation  and  the  boundary  conditions. 

Parry:  As  I understood  it,  you  assumed  that  the  cross-sectional  area  at  x * 0 was  1?  So  your  results,  then,  will  be 

given  in  terms  of  a ratio  of  C(x)  to  C(0)?  Anyone  who  has  a big  mouth,  so  to  speak,  would  therefore  upset 
your  calculations,  or,  being  more  serious  about  it,  the  point  is  that  die  C is  not  constant  at  x * 0. 
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Sondhi:  Right.  We  scale  down  big  mouths  in  the  following  way.  To  make  this  measurement  it  is  not  feasible  to 

just  put  a pulse  of  sound  at  the  lipe  and  make  a pressure  measurement  right  there.  Whit  we  do  is  we  have 
what  is  known  as  an  impedance  tube,  into  which  we  put  some  energy,  and  the  person  holds  his  mouth  at  one 
end  by  a flexible  coupling.  We  start  reconstructing  the  area  from  some  point  in  the  tube  where  we  know  what 
the  area  is.  That  is  the  way  we  get  around  the  problem.  But  in  general  you  are  right.  If  you  were  going  to 
nrc>ke  measurements  without  such  a tube,  then  the  scale  factor  would  have  to  be  restored  by  other  means. 

Portinari:  Did  you  look  into  conditions  of  h such  that  C(x)  is  of  finite  range,  i.e.t  is  constant  after  a finite  length? 

Sondhi:  You  mean  that  C(x)  should  have  some  variation  and  then  stay  constant? 

Portinari:  That’s  right. 

Sondhi:  I haven’t  looked  at  that  problem,  but  if  that  were  the  case,  you  see,  it  woi id  be  easier  to  look  at  it  as  a 

termination  of  the  transmisiion  line,  and  solve  the  problem  that  way.  The  uniform  tube  is  equivalent  to  a 
resistance  termination  on  a transmission  line,  so  1 could  look  at  this  variable  transmission  line  with  a resistor 
termination.  I am  not  sure  I understood  your  problem  correctly. 

Portinari:  What  1 am  asking  is  more  of  a synthesis  question  than  diagnosis.  I mean,  you  have  h.  You  want  to 

construct  C(x),  and  you  would  be  interested  in  havmg  something  that  occupies  a finite  length.  What  would 
be  conditions  on  h to  guarantee  that? 

Sondhi:  I haw.  not  looked  at  that  problem.  We  have  looked  a little  bit  at  transmission  lines  that  include  distributed 

resistive  and  conductive  losses.  This  is  the  question  I asked  Professor  Kay,  in  fact.  What  happen:  if  you  know 
the  resistance  and  conductance?  Can  you  get  C from  such  measurements? 

We  don’t  know  the  answer  in  general,  but  in  some  special  cases  we  nave  activity  solved  that  problem, 
also  1 have  never  seen  anybody  treat  die  inverse  problem  including  loss  in  transmission  lines  before,  even  in 
special  caws 'Sondhi  and  Gopinath,  1971b]. 

Mittra:  1 wonder  if  Dr.  Moses  and  you  would  comment  on  the  correspondence  between  the  time  domair.  Cellar,  <f- 

Levitan  equation  and  this  one? 

Moses:  I haven’t  studied  this  equation  long  enough,  realty,  to  make  a contrast.  With  the  Geiland-Levitan,  when 
you  do  it  in  the  time  domain,  in  which  you  start  at  f * 0,  you  assume  a time  cut-off.  Your  line  doesn’t  go  to 
infinity.  I’ll  do  it  in  the  Schrodinger  equation. 

You  start  at  f»0.  You  send  the  signal  in.  It  bounces  back  from  the  reflection  coefficient,  or  the  con- 
volution of  the  reflection  coefficient,  whatever  wave  you  get  back.  Youcangetthereflectioncoeffkient,andyeu 
carry  out  the  Geiland-Levitan.  There’s  a finite  time  for  this  packet  to  coma  out.  The  cutoff  of  the  potential 
permits  you  to  send  something  e’se.  I assume  the  same  thing  happens  here 

Mittra:  He  it  me  king  distinction.*.  t<  he  never  has  to  worry  about  the  terminations.  He  is  steady  on  the  lino.  He  is 

just  generating  the  assumptions 

Shmoys:  It  i*  in  infinite  lin*,  because  he  considers  transient  response  for  a finite  time  lag. 

Sondhi:  Wba  t Professor  Shmoys  is  saying  is  righ*.  In  T units  of  time  you  can  only  traverse  T/2  units  of  the  line; 
whethir  it  is  infinite  from  there  on  or  not  is  immaterial 

But  1 don’t  realty  know  what  is  meant  by  the  time  domain  Geiland-Levitan  equation  because  I have  never 
seen  i . I would  like  to. 
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Moses:  I think  causality  is  the  essential  feature.  I think  spectral  theory  may  be  a way  of  solving  it,  but  the  causality 

makes  it  very  interesting. 

Unidentified  speaker:  Harry,  you  should  make  that  clear,  it  is  the  Marchenko  version  that  you  have  in  mind. 

Moses:  1 was  thinking,  actually,  of  the  Gelland-Levitan,  with  0 to  °°,  I mean  ±«°.  You  just  cut  off  the  potential 

at  0.  Marchenko  is  very  close  to  that. 
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ABSTRACT 


Work  is  being  earned  out  at  the  Naval  Electronics  Laboratory  Center  to  determine  D-region  electron-density 
distributions  from  VLF-LF  sounder  data.  The  direct  problem  is  to  compute  reflection  coefficients  for  an  assumed 
electron-density  profile,  using  a full-wave  solution.  The  inverse  problem  is  treated  numerically,  using  derivatives  of 
the  reflection  matrix  with  respect  to  model  parameters.  A technique  for  the  rapid  computation  of  the  derivatives 
is  described.  An  algorithm  for  determining  resolution  in  the  profile  is  discussed. 


INTRODUCTION 


Our  work  at  the  Naval  Electronics  Laboratory  Center  is  that  of  finding  D-region  electron-density  distribu- 
tions 1.0m  very  low  frequency  reflection  data.  It  is  an  inverse  problem,  since  the  direct  problem  is  that  of  computing 
reflection  coefficients  when  the  electrcr.  density  profile  is  known.  The  direct  problem  involves  the  numerical  solu- 
tion of  the  full-wave  problem  for  a lossy  anisotropic  ionosphere,  since,  in  the  VLF  region  of  the  radio  spectrum,  ray 
theory  no  longer  applies.  Data  were  first  taken  in  1961 . In  an  effort  to  perform  the  inverse  problem  of  finding  elec- 
tron density  profiles  from  the  data,  v,  - first  attempted  triai-and  error  methods,  rather  unsuccessfully. 

Function  optimization  techniques  were  later  used  for  Ending  these  distributions.  Sheddy  [1963a]  has  discussed 
this  subject,  and  an  error  analysis  for  relating  the  uncertainties  in  the  data  to  uncertainties  in  the  profile  subsequently 
was  given  by  Shellman  [1969, 1970] . 

Not  many  sets  of  data  were  analyzed,  however,  because  of  the  large  amc  int  of  computation  required.  The 
problem  can  be  appreciated  a little  more  by  mentioning  that  it  takes  about  1 or  2 sec  on  an  IBM  360-65  computer 
to  compute  the  fuli-wave  solution  for  one  frequency.  Data  may  be  taken  at  a*  many  as  ten  frequencies,  so  carrying 
out  a complete  set  of  full-wave  solutions  for  very  many  trial  profiles  could  run  up  a computer  bill  very  quickly. 

About  two  months  ago  we  discovered  a rapid  way  of  differentiating  the  numerical  integration  with  respect  to 
electron-density  model  parameters,  which  has  speeded  up  our  computation  considerably.  The  method  is  discussed 
a bit  later. 

This  meeting  has  certainly  demonstrated  the  importance  of  interdisciplinary  communication,  as  we  have  seen 
parallels  with  our  efforts  at  NELC,  especially  in  the  presentations  given  by  the  geophysicists.  Doubtless  our  work 
with  the  VLF  sounder  data  could  have  been  much  easier  had  we  known  of  these  other  efforts  sooner. 

The  region  of  the  ionosphere  that  we  are  interested  in,  namely,  the  D-region,  is  inaccessible  to  HF  sounde  s. 

The  densities  are  generally  below  104  electrons  per  cm* , and  the  region  was  generally  denoted  by  a dotted  line  at 
the  bottom  of  the  profile  in  the  talks  on  HF  sounders. 
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The  NELC  VLF  sounder  [Paulson  and  Thnisen,  1962]  is  located  at  Sentinel,  Arizona,  in  a region  of  lava  flow, 
and  hence  low  electrical  conductivity.  The  transmitter  (dipole)  antenna  is  oriented  along  the  magnetic  north-south 
direction,  and  is  about  9 km  long.  The  receiver  vans  are  usually  placed  about  IS  km  magnetic  west  of  the  trans- 
mitter. Two  receiver  loops,  oriented  perpendicular  to  one  another,  are  used.  One  receives  the  normally  reflected 
skywave  and  the  other  receives  the  conversion  component,  which  is  present  only  because  of  anisotropy  -the  influence 
of  the  earth’s  magnetic  field.  The  two  components  are  usually  of  the  same  order  of  magnitude. 

Reflection  from  the  ionosphere  is  described  by  the  reflection  matrix 


R - 


l»*«lexp  (<«*») 

l|*il  expO'n$i) 


li^ill  exp  (f^i) 
liflil  exp  (ii0i) 


where  the  first  subscript  refers  to  the  polarization  of  the  upgoing  wave  with  respect  to  the  plane  of  incidence  and  the 
second  subscript  refers  to  the  downcoming  wave.  Because  of  the  orientation  of  transmitter  and  receiver,  the  right  half 
of  this  matrix  applies  to  the  sounder,  hence  we  have  four  (real)  measurables  for  each  sounder  frequency. 

For  convenience  of  notation,  we  denote  all  the  observables  by  R/  and  assign  to  each  one  an  expected  standard 
error  or.. 

What  we  are  trying  to  accomplish  is  the  following:  We  have  data  from  which  we  want  to  find  an  electron 
density  distribution:  because  the  data  have  uncertainty,  and  we  have  a finite  amount  of  data,  the  density  distribu- 
tion similarly  has  uncertainty.  So  we  have  in  the  profile  a certain  resolution  and  certain  error  width,  the  resolution 
being  related  mostly  to  the  amount  of  data,  and  the  error  band  to  both  the  amount  and  accuracy  of  the  data. 

The  measure  -if  fit  of  any  electron  density  distribu- 
tion t > the  data  is  taken  to  be  the  sum:  of  the  squares  of 
the  deviations  of  the  computed  values  r,-  from  the  data 
values  Ri , where  normalization  i*  made  by  dividing 
through  by  the  standard  errors: 

m / 

• Ri  ) 


Figure  1 shows  the  model  used.  The  electron  density 
distribution  is  specified  by  parameters  <xj  at  a set  of 
heights  hj  in  the  profile.  Logarithmic  interpolation  it 
made  between  the  points,  so  that  there  are  iwo  things 
to  find  here:  first,  the  distribution  of  heights,  which  is 
determined  by  the  type  of  data  available;  and  second, 
given  those  heights,  we  need  fne  values  of  the  parameters 
«/  that  are  the  logs  of  the  electron  densities  at  the 
heights  hj. 

We  consider  the  problems  in  reverse  order,  first  assuming  that  we  know  the  distribution  of  heights  and  treating 
the  problem  of  finding  the  parameters  aj , and  later  considering  the  distribution  of  heights. 


Figure  l.-  Electron  density  modal  for  e ■ 4, 
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THE  DERIVATIVES,  3R /day 


For  both  problems  we  need  derivatives  dr^/day  denoting  the  rjtes  of  change  of  reflection  coefficients  with 
changes  in  the  model  parameters.  The  solution  for  dR/da<,  he  .ee  for  dr, /day,  is  the  problem  we  have  recently 
solved  for  rapid  computation. 

The  full-wave  solution  for  R is  an  integral 


which  is  carried  out  by  Runge-Kutta  integration.  The  expression  for  dR/d/i  is  ven  by  Budden  [195  [ as. 


dR 

d/i 


+ S*,J)  R - RS{“)  - RS(,l)R 


The  initial  condition  for  R is  given  by  rrn  analytic  computation  [Sheddy,  1968b).  The  are  2X2  complex 
it  . ces  that  are  functions  of  the  incidence  angle  and  susceptibility  metrix,  which  in  turn  is  a function  of  the  normal- 
ized plasma,  gyro,  and  collision  frequencies,  X,  Y,  and  Z,  respectively,  and  direction  of  the  magnetic  field. 

The  normalized  electron  density  X depends  in  turn  on  the  model  parameters  ay.  We  want  to  find  derivatives 
of  R with  respect  to  ay,  which  is  the  derr  stive  on  an  integral: 


We  now  take  the  derivative  inside  the  integral: 


At  first  glance  this  does  .A  appear  to  save  us  anything  because  we  still  have  to  do  the  integration.  However,  v c 
know  that  each  parameter  of  the  model  affects  only  a certain  region  of  the  profile  above  which  the  derivativ*  is 
zetu,  so  that  the  integral  is  zero.  Within  tlie  region  affected  by  the  parameter  we  integrate  an  expression  obtained 
by  differentiating  dR/dA  with  respect  to  the  parameter  ay: 

3(dR/dA)  -/k(r8S(,,)  dS^1)  dS*M)  dS*11* 

— — = + R - R R R 

day  2 I day  day  day  da y 


+ 


i*-s(«.)  -i!  s^r 

d*y  day 


R5(is) 
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Figure  X - Three  regions  of  profile  pertinent  to  obtaining 
/tb  derivative,  »R /d«y. 


In  the  region  below  the  rlabs  effected  by  ay  this  equa- 
tion is  linear  ar  d homogeneous  since  the  first  term  in 
square  brackets  is  zero.  This  is  what  saves  us,  because  it 
turns  out  that  integrations  for  linear  homogeneous  dif- 
ferential equations  can  be  performed  without  knowledge 
of  initi.'  conditions.  You  can  insert  the  initial  conditions 
at  the  end. 

Figure  2 indicates  these  three  regions.  In  integrating 
we  do  four  things.  We  perform  the  integration  for  R. 

We  integrate  through  the  pairs  of  slabs  affected  by  the 
parameters  ay  and  store  the  value  of  each  integral 
(dR/da;)yt,  at  the  bottom  of  the  pair  of  slabs.  We 
also  integrate  the  linear,  homogeneous  set  of  equations 
through  each  slab  and  store  the  value  of  the  integral  A y 
at  the  bottom  of  each  slab,  where  Ay  is  such  that 

/ 9RA 


.md  R is  a vector  with  the  same  four  elements  as  the  matrix  R.  We  can  then  work  our  way  back  up  the  profile, 
forming  the  product  of  Ay’s  and  computing 


So  the  time  to  get  the  whole  set  of  derivatives  does  not  really  depend  on  how  finely  the  profile  is  divided.  We 
get  the  whole  set  of  derivatives  in  about  the  same  time,  whether  the  profile  is  divided  into  10  slabs  or  100  or  whatever. 


SEARCH  TECHNIQUE 


Once  we  have  the  derivatives  we  use  a combination  cf  standard  search  (function  optimization)  techniques.  The 
measure  of  fit  t is  to  be  minimized.  We  know  something  about  the  first  and  second  derivatives  of  s : 


day  day 


dry  dry 

day  day 


What  we  know  forms  in  idealized  parabolic  surface  of  n dimensions  (fig.  3),  and  along  that  surface  we  make 
essentially  a Fleicher-K*evcs  search,  t hich  defines  a quasi-steepest  descent  path  down  the  surface. 


S 40 


The  minimum  of  the  real  function  s is  then  sear-jied 
for  along  this  path  by  a Fibonacci  one-dimensional  search, 
which  involves  about  half  a dozen  full-wave  solutions  for 
each  sounder  frequency  and  accounts  for  most  of  the  com- 
putation time. 

Once  the  minimum  is  found  along  this  path  we  have 
completed  an  iteration.  For  a rough  approach  to  the  mini- 
mum about  half  a dozen  iterations  may  be  sufficient. 


RESOLUTION  iN  THE  PROFILE 


MIS 

M ONI.  NTH 


Figure  3.-  Idealized  parabolic  surface,  s(a, 
defined  by  derivatives  3 R /3oty,  j = 1 , ■ 
figure,  n = 2. 


’%>• 

, n.  In  the 


Now  consider  the  problem  of  choosing  the  heights  in 
the  profile.  The  development  of  algorithm  for  choosing 
such  heights  was  begun  fairly  recently.  The  philosophy 

here  is  that  we  do  not  want  heights  hj  in  the  profile  for  which  electron  density  cannot  be  determined  from  the  data. 
We  can't  have  too  many  in,  but  on  the  other  hand  we  want  all  that  are  warranted  by  the  data. 


The  basis  for  choosing  the  heights  hj  is  the  mapping  of  the  uncertainties  in  the  data  pa.ameters  into  an  error 
band  on  the  profile.  Such  a mapping  is  described  in  detail  by  Shellman  [1970] , and  the  mathematics  are  simplified 
in  this  presentation.  The  width  of  the  error  band  on  the  profile  is  determined  both  by  the  uncertainties  in  the  data 
and  by  the  covariance  between  profile  parameters  ay.  The  width  of  the  error  band  at  the  height  hj  is 


°Oy  = 


where  /yy  is  the  ;th  diagonal  element  of  the  inverse  of  the  variance-covariance  matrix.  The  latter  has  elements 


± L III 

I da  j daj 


The  rather  simple  algorithm  for  choosing  the  hj  - determining  the  resolution  - is  then  the  following:  If  at 
three  successive  points  in  the  profile  the  error  band  is  wider  than  a chosen  upper  threshold  value,  the  middle  point 
is  deleted;  and  if  the  error  band  is  narrower  than  a chosen  lower  threshold  value,  points  are  added  on  each  side  of  th? 
point.  The  threshold  values  are  not  critical. 
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AN  EXAMPLE 


Figure  4 shows  the  iteration  steps  in  one  particular  case.  In  this  example  the  variance-covariance  matrix  was 
normalized  by  dividing  each /throw  and  /thcolumnby  [T"l(3?y/3af)a]f/i  1 hence,  the  matrix  was  actually  a 
“correlation  matrix.”  The  principle  is  the  same,  however. 


We  start  out  with  three  (open  circled  joints  in  the  profile.  The  algorithm  for  choosing  heights  then  added  a 
point  at  about  80  km,  and  one  iteration  in  the  numerical  search  led  to  the  profile  with  the  long  dashed  line.  The 
second  iteration  led  to  the  profile  with  the  ~vort  dashes. 


Figure  4.  - First  four  iteration  in  search  for  profile  which  fits 
data  taken  23  March  1971, 1300  MST.  Data  set  consisted 
of  I and  l,R,  1 at  8.3, 13.2,  21.7,  27.5, 


and  33.5  kit  Phase  data,  and 

added  in  subsequent  iterations. 


At  that  point  the  algorithm  added  a point  at 
60  km.  Then  we  go  to  the  dotted  line  in  the  next 
iteration  and  three  points  are  added,  indicated  by 
the  triangles.  The  next  iteration  then  took  us  to  the 
chain  line.  We  carded  out  three  or  four  more  itera- 
tions, but  the  final  profile  was  approximately  the 
form  of  the  last  one  shown. 

In  general,  the  algorithm  for  choosing  the  heights 
of  the  profile  leads  to  about  the  same  number  of  heights 
as  we  have  data  parameters,  which  is  encouraging 
because  that  is  what  we  would  expect;  that  is,  we  get 
the  same  amount  of  information  out  that  we  are  putting 
in. 


It  should  also  be  mentioned  that  the  collision- 
frequency  profile  has  been  treated  in  the  same  manner 
as  the  electron-density  profile.  That  is,  it  has  teen 
assumed  that  the  collision-frequency  profile  is  also  an 
unknown  to  be  found  from  the  sounder  data. 
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THE  JOST-KOHN  INVERSION  PROCEDURE 


Reese  T.  Prosser 
Dartmouth  College 


f N73- 11627 


With  reference  to  my  discussion  of  the  Jost-Kohn  procedure  in  Chapter  6,  there  are  some  conditions 
that  must  be  mposed  on  the  situation  before  reasonable  results  can  be  expected.  These  conditions  are  con- 
sidered briefly  here. 

This  discussion  is  restricted  to  the  quantum-mechanical  problem,  but  now  in  three  space-dimensions,  without 
assuming  any  radial  or  other  symmetry  of  the  potential.  We  consider  the  problem 

VJp  + co2p  = V$ 


In  Chapter  6, 1 suggested  that  we  search  for  a solution  of  the  form 


*(x,  k)  = e*’*  + J*G(x,  x’  ; w)  K(x')  p(x’)  dx’ 

where  V is  the  potential,  G the  associated  Green  function,  k the  incoming  momentum,  and  k'  the  outgoing 
momentum,  with  I kl  = Ik' I = u>.  1 said  that  in  the  far-fleld  limit  this  solution  would  tend  to  something  like 

eflkl  Ixl 

*(x,k)  * e***  + T(k',k)  - — + o 

4ir  M 

that  is,  as  a.i  ingoing  piaiv^  wave  plus  an  outgoing  spherical  wave  weighted  by  a “reactance  matrix”  7(k',  k)  of  the 
form 


r(k',  k)  =/*-*’ -y  K(y)*(y, k) dy 

Earlier  I implied,  but  didn’t  make  explicit,  that  the  scattering  data  of  die  problem  are  all  contained  in  this  reactance 
matrix,  this  far-field  behavior  being  uniformly  good  for  large  Ixl,  and  die  correction  terms  being  o(l/  Ixl). 

The  Jost-Kohn  procedure  assumes  that  we  have  some  information  about  the  reactance  matrix,  from  which  we 
want  to  obtain  the  original  potential  V.  The  situation  is  better  understood  if  we  embed  the  problem  in  a family  of 
nonphysical,  purely  mathematical  problems  defined  by  nonlocal  potentials.  Specifically,  we  let  this  problem  be  a 
member  of  a larger  class  of  problems  in  which  the  potential  has  the  form 


F*  « / K(x,  x')p(x')  dx' 


We  begin  by'  counting  dimensions.  For  a potential  of  this  form,  we  must  find  a ft  nctic  n of  six  variables.  We 
thus  need  a six-parameter  family  of  data,  and  that  is  supplied  by  the  complete  reactance  matrix. 


Proceeding  formally,  we  express  the  reactance  matrix  T in  terms  of  the  potential  matrix  V . Since 
</>-0o  + GV<t>  and  r(k',k)  = < 4>'0  \T\<t>Q)  = ( <f>0  W\<p),  where  <£0  = e‘^'x  and  <p  = <p(x,  k),  we  have 

% \T\<t>0)  = W\<t>0)  + (<t>'0  WGT  l*> 


or,  in  operator  form. 


T = V + VGT 


Thus 


T = ( I-VG)~l  V 

= V + VGV  + VGVGV  + • • • 


This  argument  gjves  us  a formal  series,  the  Bom  series,  for  the  matrix  T in  terms  of  the  matrix  V.  The  Born 
series  converges,  in  general,  only  if  the  potential  is  sufficiently  small.  Specifically,  we  must  introduce  a suitable  space 
of  operators  A,  B, , acting  on  square-integrable  functions,  and  then  define  a suitable  norm  for  these  operators 
such  that  I)  AGB  K S A S 9 B I , where  G is  Jie  appropriate  Green’s  function,  and  require  that,  in  this  norm. 


IfKl 


Clearly,  the  Jost-Kohn  inversion  procedure  cannot  be  expected  to  yield  anything  usetul  unless  the  Born  series 
converges,  because  the  entire  procedure  is  based  on  this  series.  We  are  therefore  restricted  immediately  to  weak 
potentials  without  bound  states.  The  same  argument  applies  to  the  Jost-Kohn  procedure  for  other  types  of  scatter- 
ing problems,  wliere  it  also  depends  on  the  convergence  of  a perturbation  series. 

The  interesting  thing,  however,  is  that  this  requirement,  seen  here  to  be  necessary,  is  also  essentially  sufficient. 
Suppose  l insist  that  the  norm  of  V is  not  only  less  than  one,  but  also  less  than  one-half: 

I VI  < — 

2 


Tnen  I see  that,  since 


T = V + VGV  + VGVGV  + • * • 


we  must  have 


m<m+m1  + m3+-« 
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or 


1 1 1 

T I <-  + - + — + 
2 4 8 


= 1 


Thus,  we  can  invert  the  equation 


T = V * VGT 


for  V in  terms  of  T : 


V = T(I  + GT ) 

= T - TGT  + TGTGT  - • • • + 

giving  V as  the  sum  of  a formal  series  that  converges  because  I T I < 1 . 

This  is  all  the  J Kohn  procedure  does.  It  is  a very  simple-minded  procedure,  which  is  not  at  all  sensitive  to 
any  of  the  details  of  t ie  problem.  It  is  sensitive  only  to  the  fact  that  the  problem  must  be  solvable  by  convergent 
perturbation  series. 

This  isn’t  the  whole  story,  of  course,  because  in  practice  we  are  looking  for  local  potentials,  and  we  arc  given 
only  part  of  the  con  plete  reactance  matrix.  We  generally  know  the  scattering  data  only  for  certain  aspect  and  scattering 
angles.  For  exampl  - , we  know  the  case  where  k’  is  -k  -that  is,  the  backscattering  data  for  all  aspects.  Or  more 
commonly  in  quantum  mechanics,  we  know  the  scattering  data  for  fixed  aspect  and  all  scattering  angles.  In  either 
case,  we  know  some  but  not  all  of  the  scattering  data.  Counting  dimensions  again,  we  see  that  a local  potential  is  a 
function  of  three  variables,  so  that  in  general  we  will  need  a three-parameter  family  of  information,  and  that’s  just 
what  we  have  in  the  two  cases  mentioned  above. 

Now  the  question  is:  If  we  know  some  part  of  the  scattering  data-some  three-parameter  part-and  we  know 
that  the  potential  is  local,  can  we  then  obtain  the  remaining  scattering  data,  by  analytic  continuation  or  some  other 
method,  to  construct  the  complete  reactance  matrix?  This  is  where  the  papers  of  Faddeev  mentioned  by  Calogero 
(Chap.  S)  are  so  important;  Faddeev  apparently  does  provide  just  exactly  flat  kind  of  information. 

At  this  point,  we  can  see  that  we  have  established  a one-to-one  correspondence  between  a certain  class  of  weak 
potentials,  local  and  nonlocal,  and  a corresponding  class  of  weak  reactance  matrices.  Moreover,  this  correspondence 
is  described  by  convergent  perturbation  series  in  V>th  directions.  Now  let’s  suppose  that  the  original  potent’ si  was 
Ln  fact  local.  The  inversion  procedure  can  be  earned  out  without  this  extra  knowledge  provided  that  the  complete 
reactance  matrix  is  known.  That  the  potential  is  local  is  irrelevant  to  the  inversion  procedure,  but  is  vital  in  the  pre- 
liminary process  of  obtaining  the  complete  reactance  matrix  from  a suitable  partial  description.  It  would  be  highly 
desirable,  I think,  to  find  computationally  effective  wtyj  of  performing  this  preliminary  process. 

Suppose  I look  at  the  first  Born  approximation  only,  taking  die  aeries  for  the  T matrix  and  ignoring  all  but 
the  first-order  term 


? * K + 
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If  1 invert  this  equation  I find  simply  that  in  this  approximation  T a V , If  we  write  this  equation  in  the  momentum 
space  representation,  and  assume  the  potential  is  local,  we  obtain 

7-(k\  k)  a K(k'  - k) 

If  we  know,  for  example,  the  backscatterin0  data  at  all  aspect  angles,  then  we  know  T(k',  k)  for  k'  = -k,  and  so 

b(k)  = T(-k,  k)  3!  K(-2k) 


*0) 


In  other  words,  in  this  approximation  the  momentum  space  description  of  the  potential  is  essentially  given  by  the 
backscattering  data.  This  is  Bojarski’s  result  (discussed  earlier  in  this  chapter),  translated  into  the  quantum- 
mechanical  problem.  His  very  pretty  result  is,  I think,  capable  of  extension  to  all  of  these  : mattering  problems. 

One  can  say  something  qualitative  now  about  how  good  the  above  approximation  is.  It  is  particularly  good 
at  high  energies,  as  is  well  known,  because  the  Bom  approximation  becomes  better  as  you  go  higher  in  energy. 
This  observation  is  the  analog  of  the  one  that  Bojarski’s  result  is  a physical  optics  approximation  valid  fnr  high 
frequencies,  and  the  higher  the  better. 

There  are  many  ramifications  of  the  Jost  Kohn  procedure  that  are  not  yet  clear.  However,  the  subject  can 
probably  be  pursued  with  profit  in  various  directions  to  provide  both  qualitative  information  and  effective  calcu- 
lation procedures  for  all  these  different  scattering  problems. 


DISCUSSION 


Moses:  I was  going  to  suggest  that  this  procedure  be  tried  for  the  one-dimensional  case  from  plus  to  minus  infinity, 

where  tilings  are  much  simpler,  in  fact  some  years  ago  I applied  it  to  the  one-dimensional  problem,  x going 
from  minus  infinity  to  plus  infinity,  and  did  the  first  two  iterations.  I wanted  to  compare  the  first  two  iterations 
with  the  first  two  iterations  of  the  Gelfand-Levitan  one-dimensional  equation.  Much  to  my  surprise  they  were 
the  same.  So  I conjecture,  without  proof,  that  term  by  term  the  Jost-Kohn  procedure  agrees  with  the  Gelfand- 
Levitan  iteration  procedure  if  they  converge.  The  Gelfand-Levitan  iteration  procedure  seemed  to  converge  if 
there  were  no  bound  states. 

Newton:  You  are  talking  about  the  Marchenko  version  of  it? 

Moses:  No,  no.  I am  talking  about  the  one-dimensional,  Kay  version,  x going  from  plus  to  minus  infinity. 

Chadan:  I think  this  app.oach  which  you  explained  has  one  drawback.  As  you  say,  the  first  term  is  good  at  higher 

energies.  That  means  that  it  is  good  for  the  inner  part  of  the  potential  ir.  r space.  Now,  at  high  energies  we 
know  that  the  potential  approach  to  many  of  our  problems  is  not  good.  The  potential  approach  is  good  for 
low  energies,  and  there  you  have  to  know  the  outer  part  of  the  potential. 

Prosser:  I agree  completely. 

Moses:  It  is  just  that  this  is  simpler  to  treat,  I think,  for  electromagnetic  theory  and  perhaps  for  the  model. 

Chadan:  Yes,  because  if  the  frequency  is  large,  then  you  see  lots  of  details  of  the  scatterer. 

Calogero:  I think  the  fact  that  if  you  expand  the  Gelfand-Levitan  equation  or  the  Marchenko  equation  or  this 

approach,  they  will  always  give  the  same  result,  term  by  term.  But  this  is  not  surprising.  This  could  not  be 
otherwise  because  we  know  that  so  long  as  we  stay  within  the  class  cf  nonsingular  potentials  the  scattering 
amplitude  is  holomorphic  at  small  values  of  the  coupling  constant,  and  therefore  you  have  the  power  expansion 
in  the  coupling  constant,  which  is  convergent  in  the  neighborhood  of  g = 0 (let’s  call  g the  coupling  constant), 
and  therefore  the  two  power  functions  cannot  but  coincide;  the  same  is  true  for  the  inverse  problem,  so  t!  is  is 
certainly  so. 

Moses:  Normally  one  puts  the  coupling  constant  in  front  of  the  potential. 

Calogero:  Yes,  but  also  the  inverse  is  true,  that  if  you  put  the  coupling  constant  in  front  of  the  reflection  coefficient 

and  you  expand  the  potential  as  a power  series  in  this  coupling  constant,  this  is  also  convergent  provided  the  re- 
flection coefficient  satisfies  certain  conditions. 

Moses:  That’s  right. 

Calogero:  But  these  are  just  the  conditions  that  correspond  to  the  requirement  that  the  potential  be  nonsingular, 

and  this  has  to  do  v ith  the  asymptotic  behavior  in  k. 

Moses:  But  this  is  an  inverse  problem.  You  want  the  condition  on  the  reflection  coefficient. 

Calogero:  These  conditions  are  ruy  to  find:  They  are  just  the  conditions  on  the  asymptotic  behavior  of  large  k . 

It  has  to  decay  at  large  k sufficiently  rapidly  so  that  when  you  iterate  you  don’t  get  divergences. 

Moses:  That’s  all  1 wanted  to  say.  Incidentally  I have  to  say  that  when  bound  states  were  present,  the  iteration 

serves,  for  the  Geliand-Levitan  equation  did  diverge.  You  have  to  remove  the  residues  corresponding  to  bound 
states. 
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Sabatier:  There  is  a recent  series  of  papers  by  Barringer,  and  other  nuclear  physicists,  on  the  off-shell  continua- 

tion of  t , and  its  use  in  inverse  problems. 
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THE  LOCATION  OF  THE  POLES  OF  THE  SCATTERING  MATRIX 

Peter  D,  Lax 
New  York  University 

and 

Ralph  S.  Phillips  t 

Stanford  University  **  ** 


This  report  describes  a number  of  recent  results  bearing  on  the  direct  problem;  that  is,  they  relate  geometric 
properties  of  the  scattering  obstacles  to  analytic  properties  of  the  associated  scattering  matrix. 

Our  results  deal  with  exterior  problems;  for ::  uplicity,  let’s  take  the  acoustic  case,  governed  by  the  wave  equat.on 

t itt  - Au  = 0 (1) 

in  the  exterior  of  an  obstacle  O,  on  which  u satifie*  the  boundary  condition 


d 


d r. 


u 


= 0 


(2) 


Other  boundary  conditions  such  as  u - 0 and  du/dn  + Ku  - 0 will  also  be  considered. 

We  have  studied  near-field  phenomena-  the  asymptotic  behavior  near  the  obstacle  of  solutions  whose  initial 
values  are  zero  out3ide  of  a bounded  set.  What  happens  asymptotically  to  such  a signal  depends  on  the  geometry  of 
the  obstacle.  The  key  property  seems  to  be  whether  there  are  reflected  rays  that  stay  inside  the  hull  of  the  obstacle 
for  arbitrary  lengths  of  time.  If  there  are  such  rays,  the  obstacle  is  nailed  capturing.  If  on  the  other  hand  all  rays  are 
reflected  out  to  infinity  after  a maximum  sojourn  time,  the  obstacle  is  called  noncapturing.  Convex  obstacles  are 
noncapturing  par  excellence,  and  so  are  star-shaped  ones. 

Our  conjecture  for  noncapturing  obstacles  is  that  there  exists  a sequence  of  complex  numbers  jiy,  / ■ 1,2,..., 

0 > Rtpi  > Re/ij  >•••-*•  -«e  (3) 

and  functions  Ky(x),  j - 1, . . . , depending  only  on  the  obstacle,  such  that  every  solution  of  the  wave  equation  in 
the  exterior  whose  initial  values  have  compact  support  has  an  asymptotic  expansion  of  the  form 


u(x,  t)  =*  ^ aje  y Wj(x)  (4) 

the  numbers  ay  depend  cm  the  solution  in  question.  This  conjecture  has  been  proved  for  convex  bodies  by  Ludwig 
andMorawetz  [ 1969)  and  Philtipt  [1969].  Few  star-shaped  bodies,  Lax  et  al  [1963]  have  shown  a weaker  result; 

u(x.  t ) - 0 (*-*')  (5) 

where  k is  some  positive  constant  that  depends  only  on  the  obstacle. 

8-50 


The  intuitive  reason  for  the  exponential  decay  indicated  in  Eqs.  (4)  and  (5)  is  this:  outside  a noncapturing 
obstacle  all  rays  go  to  infinity ; according  to  geometrical  acoustics  signals  follow  rays  except  for  a diffracted  portion. 
This  remaining  portion  again  propagates  out  to  infinity  along  rays,  except  for  a diffracted  portion,  and  so  on  ad 
infinitum.  The  signal  remaining  around  the  obstacle  decays  exponentially  in  t . 

High-frequency  signals  suffer  little  diffraction;  that  is,  they  follow  rays  pretty  closely  even  after  a multiplicity 
of  reflections.  Therefore,  around  a capturing  obstacle,  which  supports  arbitrarily  long  reflected  rays,  there  will  be 
signals  whose  strength  around  the  obstacle  is  hardly  diminished  no  matter  how  large  t is. 

There  is  a bit  of  mystery  about  the  numbers  Pj  and  functions  Wj  entering  the  expansion  in  Eq.  (4);  purely 
formally,  e^j!  w,(x)  must  be  a solution  of  the  wave  equation  so  that 


V/Vfj  - AWy 


(6) 


and  Wj  must  satisfy  the  boundary  condition 


d 

— Wj  - 0 on  the  boundary  of  0 


(7) 


Clearly  we  are  not  dealing  with  ordinary,  or  even  generalized,  eigenfunctions  that  figure  in  the  spectral  resolution  of 
A exterior  to  0,  since  the  eigenvalue  pj  is  not  real.  To  distinguish  them  from  the  eigenfunctions  occurring  in  the 
spectral  resolution  of  A,  we  shall  call  the  w-  and  pj  occurring  in  Eq.  (4)  the  scattering  eigenfunctions  and  fre- 
quencies of  A in  the  exterior  of  0,  subject  to  the  boundary  condition  in  Eq.  (7).  Near  infinity,  the  scattering 
eigenfunctions  satisfy  a radiation  condition  of  the  following  sort: 

For  I x I large,  w(x)  can  be  expanded  in  a convergent  series  of  the  form 


w(r)  = « 


o 


(8) 


where  cw(x)  are  homogeneous  functions  of  degree  0. 

In  theorem  5.5  of  Lax  and  Phillips  [1969] , the  following  is  proved: 

Theorem  1 For  any  obstacle  0 there  are  infinitely  many  real  scattering  frequencies  p •;  the  number 
N(o)  of  pj  between  0 and  -a  satisfies 


c < liminf 


(9) 


If  O is  star  shaped,  there  is  a corresponding  upper  bound  for  N : 

Nfo) 

lim  sup  — " — < C 
<r 


(10) 
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The  quantities  c and  C are  related  to  the  geometry  of  the  body  by  c n 1.12r  and  C*  1 .12/? 5 where  r and  R 
are  radii  of  inscribed  and  circumscribed  spheres.  In  fact  we  conjecture  that  for  any  obstacle, 


Hi 2l 

o2 


tends  to  a limit  as 

A more  recent  result,  contained  in  Lax  and  Phillips  [1972] , is  the  following: 

Theorem  2 There  are  no  scattering  frequencies,  real  or  complex,  inside  the  circle 


lit  + 


1 

2 R 


1 

I < 

2 R 


(11) 


where  R is  the  radius  of  a sphere  containing  0. 

The  following  is  a converse  of  theorem  1 : If  for  large  o, 


then  the  obstacle  in  question  does  not  contain  a sphere  of  radius  0.94  (A  'l/1 ),  and  is  not  contained  inside  a sphere 
of  radius  0.94  U_,/2). 

The  following  is  a converse  of  theorem  2: 

If  p is  a scattering  frequency  contained  inside  the  circle  in  Eq.  (1 1),  then  the  obstacle  0 is  not  contained 
inside  a sphere  of  radius  R . 

If  we  could  observe  experimentally  the  location  of  the  scattering  frequencies,  the  last  two  results  would  shed 
some  crude  light  on  the  overall  size  of  the  obstacle  0.  The  chances  of  making  such  measurements  are  not  very  good, 
unfortunately.  Since  the  natural  characterization  of  these  frequencies  is  in  terms  of  the  relaxation  of  the  reflected 
near  field,  we  need  near  field  meaxureii*..its,  and  these  are  not  etsy  to  come  by.  It  is  unlikely  that  we  can  measure 
accurately  any  of  the  scattering  frequencies  but  the  one  that  decays  most  slowly-that  is,  the  one  whose  real  part  is 
largest.  Thus,  there  is  no  chance  of  making  cny  practical  use  of  the  converse  of  theorem  1 , and  only  a slim  chance 
that  the  converv.  of  theorem  ? can  be  used  to  give  a lower  bound  on  the  size  of  O. 
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DISCUSSION 


Moses:  There  is  another  inverse  problem  which  we  haven’t  mentioned,  and  tl:it  is  the  synthesis  prob'em  That  is, 

you  ."dually  want  to  construct  something  that  has  a predetermined  shape  or  something.  What  if  you  want  to 
construct  a hard  body?  You  have  to  start  off  with  something,  h scattering  operator  has  to  have  some  require- 
ments that  indeed  produce  a hard  body.  These  are  necessary  conditions  for  this. 

In  fact,  in  real  electromagnetic  engineering  work  this  synthesis  problem  is  a very  important  one.  You 
actually  do  want  bodies  that  have  specific  reflection.  You  do  want  transmission  lines  of  that  type.  It  is  not  a 
diagnostic.  It  is  a construction. 

Newton:  1 just  wanted  to  make  a brief  remark  about  this  work.  This  result,  I believe,  is  quite  different  from  what 

one  could  possibly  hope  to  expect  in  quantum  mechanics,  because  in  the  first  place  you  don’t  have  any  signals 
with  compact  support  in  quantum  mechanics,  and  partly  because  of  that  of  course,  the  cause  is  the  same 
the  re  will  be  always  asymptotic  terms  that  go  down  as  the  inverse  power  of  t . You  don’t  have  this  kind  of 
result. 


Now,  your  second  result  there  about  the  distribution  of  these,  the  leading  term  n there,  seems  to  me  to 
be  clcsely  related  to  a result  in  quantum  mechanics  which,  in  the  case  in  which  the  scatterer  has  finite  support , 
relates  the  asymptotic  decay,  the  relaxation  length,  which  is  exactly  the  analog  oi  ;» , to  the  size  of  the  body. 

So  I think  that  is  quite  directly  related  to  it. 

There  is  a reference  to  the  quantum  mechanical  result  in  iny  hook.  In  fact,  it  is  discussed  there. 

Kay:  How  important  is  the  requirement  that  the  ir  tial  value  have  compact  support?  Suppose  you  have  a point 

source  and  move  it  out  to  infinity  so  that  it  becomes  a plane  wave? 

If  you  are  treating  this  as  an  initial  value  problem,  you  start  with  a signal  which  is  emo  everywhere  except 
in  a bounded  region;  this  would  presumably  satisfy  the  requirement.  Then  can  you  move  that  region  out  to 
infinity,  increasing  the  strength  as  you  go.  sc  that  you  end  up  with  an  incident  plane  wave? 

Lax:  Well,  the  farther  out  you  move  it  the  longer  yc u have  to  w»it  for  it  to  come  in. 

What  is  important  here  is  that  we  don’t  want  things  coming  in  after  the  initial  signal  arrives.  We  don't 
want  rays  conti  .cully  coming  in  from  infinity  and  interfering  with  this  somewhat  delicate  decay  behavior. 

That  is  why  I insisted  on  a pulse 

Moses:  I don't  think  it  has  to  be  1 mean,  any  pulse  would  be  the  scattering  operator. 

Kay:  No,  but  the  question  here  as  related  to  the  quantum  mcwianical  application  would  be  important,  because 

there  we  would  like  to  consider  essentially  a plane  wave. 

Moses:  But  you  could  use  the  time-dependent  analog  if  you  chose.  The  scattering  problem  «ould  be  the  same. 

Prosser:  This  depends  effectively  on  the  noncapturing  conditions,  because  if  you  have  a body  that  captures,  then 

you  can  have  aroitrarily  slow  decay  rates. 

Lax:  That  is  right.  If  there  is  a phenomenon  of  capturing  possible,  as  in  this  picture,  then  the  following  happens 

Lock  at  tLs  interior  cavity.  If  it  is  bitten  olf  sufficiently  sharply  there  is  a reasonaole  way  of  closing  it  off  as 
an  interior  problem,  as  an  interior  region,  and  this  interior  region  wi!  ,ave  eigenvalues  Xl(. , which  correspond 
to  vibration  frequencies  J ). 


H-5J 


Then  the  exterior  problem  will  have  p ^ , which  will  be  approximately  like  these  minus  a small  real  part 
p fc,  where  p^  tends  to  0 as  A tends  to  infinity.  For  the  higher  and  higher  modes  the  exteiior  modes  will  be 
very  close  to  the  interior  modes.  These,  however,  are  highly  oscillatory  modes,  and  the  statement  referred  to 
ihe  nonoscillating  modes.  Hie  ncioscillating  decay  modes  are  related  to  the  bulk  properties  of  the  object,  and 
;*>e  highly  oscillatory  ones  are  to  be  detailed. 

Vmmvs:  l think  these  results  are  very  interesting,  and  the  subject  raises  some  interesting  possibilities  for  inversion 

1 don’t  know  how,  but  it  seems  that  it  should  be  possible.  Certainly  the  sort  of  thing  that  some  people  in 
electromagnetics  have  been  pUving  with-for  example,  it  is  possible  to  reconstruct  the  size  of  the  sphere  from 
the  scattered  "icid?  Now  » oody  knows  it  is.  It  has  been  done  in  practice.  This  is  another  ompleteiy 
different  appu  jeh  to  the  s.-r.  . kind  of  question.  Once  you  have  done  it  for  a sphere  you  can  go  to  more  com- 
plicated things. : d perhaps  from  the  knowledge  of  the  radiative  states  you  can  deduce  the  properties  of  the 
system  . 

Gi'ogero:  Just  or.e  remark  in  connection  with  whi  you  have  just  said.  There  is  a tendency  to  think  that  once  one 

has  proved  existence  and  uniqueness  and  perhaps  a constructive  procedure,  the  inverse  problem  is  solved.  That 
is  net  quite  so,  because  you  may  have  a very  complicated  construction  procedure  going  through  an  integral 
equation,  for  instance. 

Now,  one  can  try  to  formulate  other  less  ambitiocs  inverse  problems,  in  which  you  do  not  want  to  recon- 
struct the  whole  thing  but  you  want  to  find  some  features  of  it,  and  then,  of  course,  you  may  want  to  find 
interesting  features,  general  shape  or  things  like  this.  Then  you  may  hops,  if  you  are  lucky,  that  you  may  be 
ibie  to  set  up  an  inverse  problem  that  will  produce  only  these  features,  and  this  may  be  much  simpler.  The  case 
l was  talking  about,  the  result  obtained  by  Newton  and  Faddeev,  is  an  example  of  this  type,  in  which,  in  the 
context  of  the  fixed  aaglular  momentum  problem  in  quantum  mechanics,  the  general  problem  is  solved  through 
the  Gelfand-Levitan  and  Marchenko  equations,  but  that  involves  solving  an  integral  equation.  It  turns  out  that 
the  explicit  formulas  can  be  obtained  for  the  value  of  the  potential,  in  fact  all  its  derivatives,  at  the  origin, 
which  is,  of  course,  much  better,  if  one  is  interested  in  that  feature  of  the  potential,  because  that  is  an  explicit 
formula. 

Shmoys:  This  would  give  us  a different  approach  than  comes  in  from  the  other  side.  It  appears  to  yield  some 

intormatiuo  about  the  extent  or  moment. 

Calogrro:  Yes. 

Parker  I would  just  like  to  amplify  your  remark.  This  turns  out  to  be  exactly  what  happens  in  a specific  geo- 
physical inverse  problem  where,  in  the  past,  people  have  tr>d  to  be  far  too  ambitious.  The  data,  in  fact,  you 
can  prove  quite  accurately,  are  completely  ambiguous  on  the  abject  of  the  material  [being  sought  from  the 
measurement]  It  has  to  do  with  the  magnetization  of  sea  mounts.  They  have  been  trying  to  construct  the 
internal  details  of  the  magnetization. 

But  it  turns  out  the  ultimate  purpose  of  their  work  was  a bulk  property , which  could  be  found  from  the 
measurements,  and  so  they  have  gone  to  all  this  great  length  of  trying  to  parair-  terize  the  model,  and  limit  the 
model  in  some  way  to  make  the  problem  unique,  solve  it,  ard  then  avetage  the  answer  to  find  a bulk  property. 
Now,  in  fact,  if  you  can  formulaic  the  problem  in  such  a way  to  go  directly  to  the  bulk  property,  everything  is 
nice.  I think  this  is  a veiy  important  point  you  made 

Kay:  I d like  to  mention  another  point  lie*  occurred  to  me  about  the  particular  result,  and  that  is  that  if  you  con- 

sider the  long  wavelength  limit  to  scattering,  that  is,  the  body  is  small  compared  to  the  wavelength  when  you 
get  Rayleigh  scattering,  ilte  cross  section  is  proportional  to  the  square  of  the  volume  of  the  body,  and  there  may 
be  soms  relationship  between  that  asymptotic  result  and  the  transient  result  that  might  be  worth  considering. 
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MISCELLANEOUS  COMMENTS 


R.  Parker:  I would  like  to  raise  a question  about  an  inversion  problem  th-t  has  not  been  considered  here  in  detail. 

It  is  addressed  to  those  people  who  are  considering  the  electromag'.ieiic  scattering  problem;  it  is  a three-dimen- 
sional scattering  problem  that  is  routinely  considered,  and  yet  1 don't  know  if  any  real  mathematics  has  been 
done  on  it.  It  is  the  very  important  problem  of  constructing  crystal-lattice  electron  densities  and  molecular 
structures  from  x-ray  scattering. 

The  treatment  is  just  a special  case  of  very  high-frequency  electromagnetic  scattering  from  a truly  three- 
dimensional  body  with  strong  periodicity  condit’jns  imposed  on  the  body.  There  is  a definite  element  of 
nontriviality  about  it.  The  problem  is  formal1/  very  easily  soluble  if  you  have  complex  scattering  information, 
but  in  fact  one  only  has  intensity  measurements.  The  phases  are  lost  to  you  by  photographic  film. 

1 was  wondering  if  anyone  had  looked  at  the  formal  mathematics  of  this  situation,  to  talk  about  the 
nonuniqueness,  because  people  routinely  claim  they  know  exactly  what  the  atoms,  and  so  on,  are. 

J.  Shmovs:  We  have  a fairly  actve  group  in  crystallography  at  Brooklyn  Poly,  and  despite  many  attempts  we  have 

' et  to  establish  contact  between  that  group  and  our  electromagnetics  group,  which  is  'ery  much  interested  in 
periodic  structures.  There  are  semantic  difficulties  that  seem  to  crop  up. 

I.  Kay:  The  method  of  making  holograms,  which  is  to  mix  a known  coherent  field  with  the  scattered  field,  will 

allow  record1. ig  on  film  of  phase  information.  It  seems  to  me  that  something  along  those  lines  would  be  a 
more  praracal  way  of  inverting  this  type  of  problem  than  considerations  of  intensity  alone,  which  don't  really 
have  enough  information  to  do  the  inversion.  You  need  phase  information. 

R.  Parker:  But  is  that  completely  practical? 

1 Kay:  This  isn’t  so  farfetched.  There  are  lasers  in  the  high  UV  region  right  now.  The  reason  it  is  impossible  to 

invert  pure  intensity  information  is  that  it  is  incomplete.  You  can’t  reconstruct  the  phase  information  from 
the  intensity.  This  is  a well-known  fact.  You  can  do  it  the  other  way  around. 

J.  Shmoys:  We  have  heard  the  problem  of  identifying  the  phase  function  from  the  amplitude  function  in  Professor 

Newton’s  talk. 

/.  Kay:  It  is  the  other  way  around.  Ycu  can  derive  the  amplitude  from  the  phase,  but  not  vice  versa. 

J.  Shmoys:  Holography  is  an  interesting  new  area  for  exploration  as  an  inverse  method.  1 for  one  don't  understand 

completely  its  possibilities  and  limitations. 

K.  Chadan:  I have  a comment  on  the  existence  of  the  kernel  of  the  Gelfand-Levitan  equation  in  quantum  scattering 

theory.  I will  just  discuss  the  case  of  the  S-wave.  We  have  the  radial  Schrodinger  equation  which  reads 

0"+£0'  = K0  (1) 

and  we  are  looking  for  the  regular  solution,  which  satisfies 

0(  O'1  = 0,  0’(O)=1.  (2) 

Now,  on  the  basis  of  the  Volterra  integral  equation  for  4>  which  incorporates  the  boundary  conditions  in 
Eqs.  (2),  or  ov  using  the  well-known  Poincare  theorem,  you  can  prove  that  0 is  an  even  entire  function 
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of  k for  any  value  of  r.  You  also  find  from  the  integral  equation,  that  the  asymptotic  behavior  of 
<(>  is  given  by 


where  the  first  term  is  just  the  free  solution  0()  We  see  therefore  that  0 is  an  entire  function  of  k of  order 
land  type  r for  each  value  of  r.  Moreover  is  square  integrable  on  ihe  real  k axis.  It  follows  then 

from  the  Paley-Wiener  theorem  on  the  support  of  the  Tourier  transform  of  such  functions  that  for  each  r. 


0 00  ~ 


K(r,  t)eikt 


dr 


(4* 


wh  re  K(r,t)  is  square -integrable  with  respect  to  t on  [-r,  rj . By  integrating  by  parts,  and  using  the  fact 
that  0 is  even  in  k , one  can  put  Eq.  (4)  in  the  familiar  form  given  in  Professor  Newton's  talk.  So  you  see  that 
here  in  the  scattering  problem  the  existence  of  the  kernel  K (r,  t)  is  just  a consequence  of  Eqs.  ( 1 ) and  (2)  and 
nothing  else.  This  is  in  connection  with  Professor  Moses’  remark  that  the  existence  of  the  kernel  may  be  related 
to  the  causality  condition.  This  may  be  true  in  the  time-dependent  case  with  electromagnetic  v/aves  where  you 
can  make  wave  packets  of  finite  size,  and  where  you  know  that  the  speed  of  the  particles  is  finite.  In  the  non- 
relativi  tic  case,  particles  may  have  any  speed,  and  there  are  no  wave  packets  with  sharp  fronts  since  only  posi- 
tive values  of  the  wavenumber  are  available,  so  that  there  does  not  exist  any  simple  way  of  formulating  causality. 

H.  Moses:  There  is  a time-dependent  problem  with  causality,  which  lrv  Kay  brought  up  when  he  gave  his  talk,  which 

has  exactly  the  same  scattering  properties  as  this  problem,  and  this  is  the  one  that  I wa-  referring  to.  It  is  a 
synthetic  problem,  but  it  has  exactly  the  same  scattering  properties. 

R.  Phinney:  1 hate  some  points  and  some  questions  which  are  not  my  own  by  any  means,  but  represent  things  that 

1 have  culled  fiom  conversations  in  the  past  several  days.  To  the  seismologist  the  use  of  Geifand-Levitan  methods 
has  a certain  great  attractiveness.  Many  of  us  are  just  learning  about  these  methods,  and,  of  course,  we  nave 
been  learning  with  realistic  data,  in  fact  acting  upon  this  for  some  time. 

The  first  question  is  addressed  to  the  people  in  quantum  mechanics.  Are  there  experimentalists  or  teams 
with  quantum-mechanical  data  upon  which  Geifand-Levitan  methods  are  being  applied,  or  upon  which  they 
ought  to  be  applied  ir  they  knew  better? 

R.  Newton:  As  to  the  first  part  of  the  question,  1 think  the  answer  is  definitely  no;  they  aren’t  being  applied.  Now, 

whether  they  ought  to  if  they  knew  better  is  perhaps  not  answerable  quite  that  easily.  The  tiouble  is  that  in  most 
practical  problems  in  which  one  really  wants  to  know  the  force  and  where  the  force  is  not  very  well-known- 
say  in  a nuclear  physics  problem-the  first  kind  of  inversion  problem  is  not  really  very  practical  because  you 
don’t  know  the  data  at  the  highest  energies.  The  second  kind  of  inversion  problem,  at  fixed  energy,  would  be 
much  more  practical  if  it  were  not  for  the  fact  that  particles  have  spin,  and  that  complicates  matters  considerably. 

In  fact,  the  inversion  problem  at  fixed  energy  for  particles  with  spin,  if  the  force  depends  on  the  spin,  has  not 
been  completely  solved  I had  a graduate  student  who  tried  that  as  a project  for  his  thesis.  He  didn't  push  it 
all  the  way  through  as  an  inversion  problem,  only  part  way,  and  it  looks  terribly  complicated. 

There  are  in  addition  spin-orbit  forces,  and  Sabatier  has  written  a paper  on  the  inversion  problem  with 
fixed  energy  with  the  spin-orbit  forces,  and  it  is  also  very  complicated.  So  to  what  extent,  even  after  that 
problem  is  completely  solved,  it  will  be  practical  is  quite  open.  ' 

I 
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J.  Shmoys:  Let  me  just  add  a comment.  In  answer  .o  this  sort  of  question  from  a nuclear  physicist  who  is  an 

experimentalist,  he  felt  it  would  be  nice  to  try  and  apply  these  methods,  even  assuming  the  simple  Schrodins'c 
equation.  I think  it  just  has  not  occurred  to  many  experimenta  Us  that  they  ought  to  try  to  do  something 
like  that  with  their  data.  I am  not  sure  what  the  reasons  are,  whether  it  is  inertia  or  what,  but  I think  it  should 
be  done,  probably  will  be  done  eventually,  but  there  is  somehow  no  impetus  at  present. 

R.  Newton:  Perhaps  I should  add  something  else.  These  techniques  are  sometimes  useful  not  just  for  numerical 

inversions,  but  they  are  also  useful  for  the  construction  of  explicitly  solvable  examples.  For  example,  the 
whole  class  of  Bargmann  potentials  has  been  found  by  means  of  Gelfand -Levitan  technique.  Now,  of  course, 
once  you  have  found  those  examples  you  don’t  need  the  Gelfand-Levitan  technique  any  longer.  They  stand 
on  their  own  feet.  But  that  was  the  original  way  of  finding  them.  So  there  you  have  a large  class  of  potentials 
foi  which  the  Schrodinger  equation  is  explicitly  solvable.  That  is  the  class  of  potentials  which  lead  to  scatter- 
ing matrices  that  are  rational  functions  of  k . Whenever  the  scattering  matrix  is  a rational  function  of  k , 
then  you  can  write  down  the  potential  that  leads  to  it  explicitly  for  one  angular  momentum,  that  is  for  one 
Lvalue. 

J.  Shmoys:  The  equivalent  sort  of  thing  for  transmission  lines  synthesis  was  Sharpe’s  theory. 

H.  Moses:  I might  mention  that  there  is  some  interest  in  applying  this  to  the  ionosphere,  provided  it  is  simple 

enough,  i.e.,  it  is  isotropic.  There  is  an  error  analysis  problem  to  which  1 think  the  geophysicists  have  con- 
tributed a great  deal.  You  know,  usually  you  don’t  have  all  the  frequencies  available.  Also,  in  the  past  you 
haven’t  had  amplitudes  available,  but  now  they  are  getting  amplitudes  and  phases  rather  well  for  this  scatter- 
ing. It  is  just  too  recent.  I think  it  is  only  in  the  last  couple  of  years  that  material  has  come  out  on  the  iono- 
sphere. There  is  an  error  analysis  problem  because  you  don’t  have  all  frequencies  available. 

R.  Phinney:  A slight  extension  of  my  question  concerns  the  existence  of  limits  to  the  amount  of  data,  not  neces- 

sarily the  finiteness,  but  the  cut-off  in  terms  of  frequency  bands  available.  Certainly  we  would  be  very  inter- 
ested in  methods  which  are  both  mathematically  correct,  in  some  sense,  and  practical,  for  applying  to  low  and 
high  frequency  tails;  to  observed  data  which  one  might  assume  to  be  a continuum  of  perfectly  good  data  in  the 
middle  of  an  accessible  region.  I take  it  from  the  remarks  that  this  question  has  not  yet  become  really  important. 

R Newton:  Oh,  it  is  important,  I think  that  is  a very  important  question.  In  fact,  that  is  a very  interesting  problem 

that  hasn’t  been  solved.  I think  that  will  be  an  extreme!.'  interesting  problem  to  solve  for  many  applications, 
namely,  the  problem  of  what  class  of  potentials  -if  I describe  it  in  quantum  mechanical  language-would  fit 
certain  phase  shift  curves  known  only  in  certain  energy  or  frequency  regions.  That  is  a very  interesting  question, 
but  that  hasn’t  been  solved. 

H.  Moses:  It  is  related  to  the  question  of  determining  the  character  of  the  scattering  operator  corresponding  to 

some  knowledge  of  the  potential.  Suppose  you  know  the  potential  cutoff  in  the  nuclear  physics  case.  Then 
you  do  know  the  arymptotic  forms  of  the  scattering  operator,  and  peihaps  something  could  be  done,  some 
sort  of  curve-fitting  procedure . It  is  in  its  infancy. 

J.  Shmoys:  I think  that  the  situation  in  seismology  may  be  worse,  in  a way,  than  in  quantum  mechanics  or  in 

plasma  diagnostics,  in  that  if  you  let  the  energies  become  large  compared  to  the  largest  value  of  V,  then  you 
can  say  something  readily  about  the  asymptotic  behavior,  whereas  if  you  have  a nondispersive  medium  that 
tail  is  completely  unpredictable.  The  dispersive  character  helps  in  quantum  mechanics  and  in  plasma  diagnos- 
tics, whereas  it  would  not  in  the  nondispersive  case. 

R.  Phinney:  Since  you  raised  the  practicalities  of  the  high  frequency  problem,  in  seismology  there  are  two  elements 

to  it.  The  first  is  perhaps  the  simplest  one,  and  that  is  the  existence  of  finite  dissipation  which  causes  the 
broadening  of  spectral  lines,  and  in  fact,  the  inability  tc  resolve  spectral  lines  as  free  oscillations  at  short  periods. 
This  leads  in  practice  to  a complete  traveling  wave  picture  for  the  kinematics.  This  essentially  removes  high 
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frequencies.  In  fact,  the  high  frequency  signals  that  one  observes  have  really  fewer  degrees  of  freedom  in  the 
data.  Most  of  the  signals  have  been  completely  attenuated  and  are  not  visible  at  all.  This  fortunately  removes 
some  of  the  infinity  of  degrees  of  freedom  from  the  high  frequency  problem. 

The  other  one  is  a little  trickier,  and  this  is  related  to  the  existence  in  seismology  of  multiple  scales  of 
the  model,  in  which  the  scale  length  of  the  model  can  range  from  centimeters  to  thousands  of  kilometers.  We 
have  been  using  ray  theory  for  some  time,  and  are  quite  aware  of  improvements  to  ray  theory,  which  enables 
one  to  look  asymptotically  at  amplitudes,  and  yet,  for  instance,  one  can  look  at  a sharp  interface  as  a model 
for  a particular  physical  situation,  and  study  the  reflection  coefficients  from  this  interface.  Of  course,  they 
are  frequency  independent.  One  can  then  model  this  interface  by  some  kind  of  ramp  or  Epstein  potential  which 
is  smooth  and  get  asymptotic  behavior,  and  it  is  always  analytic. 

But,  in  fact,  observed  reflection  coefficients  don’t  behave  this  way  at  all.  We  get  some  practical  fellows 
from  the  oil  business  who  will  say  “Well,  if  we  tune  the  band  pass  filters  like  this,  we  get  really  good  signals. 
Otherwise  you  can  forget  the  experiment.”  This  means  that  the  existence  of  the  multiple  scales  is  getting  to 
us,  and  in  a really  practical  sense  the  earth  is  not  even  a continuum,  and  we  use  continuum  mechanics,  of 
course,  to  derive  these  equations. 

This  leads  to  a remark  about  Tom  Jordan’s  autocorrelation  functions  for  the  model.  These  are  essentially 
smoothing  windows  which  one  decides  the  model  ought  to  be  seen  through.  They  have  been  introduced  in 
seismology,  and  yet  there  they  are  subject  most  to  a priori  assumptions  for  which  you  have  no  physical  laws 
except,  conceivably,  for  the  earth’s  core,  which  is  thought  to  be  a liquid. 

Yet  in  some  of  the  other  problems  1 have  seen  here,  in  plasmas  and  atmospheric  problems,  it  is  cleat 
that  the  physics  of  the  medium  on  small  scales  puts  restrictions  on  how  sharply  the  model  can  change,  and  so 
you  can  actually  specify  autocorrelations  in  the  model,  or  at  least  inequalities  to  these  things,  and  I think  you 
can  perhaps  do  something  very  useful  this  way.  This,  in  a sense,  provides  a rational  way  of  smoothing  the  data. 

R.  Newton:  I would  just  like  to  say  a few  words  about  something  that  may  be  of  use  to  some  people.  There  was 

some  discussion  about  the  relation  between  solving  the  exact  inverse  problem  and  a discretisation  that  one 
uses  in  approximations.  I just  want  to  give  one  little  example  of  something  which  1 think  shows  that  the 
solution  of  an  exact  inverse  problem  can  be  very  instructive,  where  one  can  be  led  astray  by  discretization. 

Let  me  go  back  to  the  phase  problem,  in  which  a scattering  cross  section  is  given  as  a function  of  the 
angle  and  one  wants  to  find  the  phase  of  the  scattering  amplitude.  Suppose  that  an  experimentalist  is  given  a 
scattering  cross  section  at  a fixed  energy  as  a function  of  the  angle,  and  he  finds  that  the  curve  that  he  gets  is 
reasonably  simple.  Suppose  he  now  fits  that  angular  distribution  by  a Legendre  expansion,  in  which  he  will  of 
course  use  only  a finite  number  of  terms,  perferably  a small  number  of  terms.  He  finds  that  to  within  his  exper- 
imental error  he  can  fit  the  cross  section  by  a linear  combination  of  Legendre  polynomials  up  to  order  2 L.  In 
other  words,  hr  has  now  2 L + 1 real  parameters  which  he  can  use  to  fit  his  curve.  Any  experimentalist  doing 
this  would  immediately  draw  the  conclusion  that  if  the  cross  section  can  be  fit  by  a Legendre  expansion  up  to 
order  2 L then  the  amplitude  must  require  only  Legendre  polynomials  up  to  order  L. 

Now,  the  generalized  optical  theorem  implies  that  each  term  in  the  Legendre  expansion  of  the  amplitude 
has  only  one  real  parameter,  namely  the  phase  shift.  Here  he  has  2 L + 1 real  parameters,  and  there  he  has  only 
L + 1 real  parameters  to  play  with.  Therefore  he  draws  the  conclusion  that  the  generalized  optical  theorem 
implies  a strong  restriction  on  the  possible  angular  dependence  of  a cross  section. 

But  from  the  exact  solution  of  the  inverse  phase  problem,  we  know  the  following.  The  fixed-point 
theorem  tells  us  that  if  you  give  me  any  reasonably  smooth  angular  distribution  for  a cross  section,  if  1 now 
multiply  that  cross  section  by  a parameter  X,  and  if  I make  X small  enough,  then  there  always  exists  an 
amplitude  that  fits  that  cross  section,  no  matter  what  the  angular  distribution  is.  Therefore,  the  generalized 
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optical  theorem  imposes  absolutely  no  restrictions  on  the  possible  angulai  distributions  that  you  can  get.  Thus 
the  conclusion  that  was  drawn  here  on  the  basis  of  the  discretization  was  completely  wrong. 

Of  course,  if  you  assume  that  the  amplitude  has  only  a finite  numb  :r  of  Legendre  polynomials  in  it,  then 
obviously  it  must  terminate  at  L.  If  the  cross  section  (the  absolute  valu'  squared  of  the  amplitude)  teiminates 
at  2 L,  then  the  amplitude  must  terminate  at  L.  But  the  point  <s  that ' ou  may  have  to  have  infinitely  many 
terms,  and  almost  all  amplitudes  that  fit  a cross  section  that  requires  only  Legendre  polynomials,  up  to  order 
2 L,  would  require  an  expansion  with  infinitely  many  coefficients. 

1 think  this  shows  that  an  exact  solution  of  an  inverse  problerr  can  teach  you  something  about  things 
that  one  otherwise  might  not  be  aware  of. 


9.  WORKSHOP  SUMMARY 


P.  C.  Sabatier  volu  nteered  to  author  an  overall  summary  to  conclude  both  the  workshop  and  these  proceedings. 
The  Editor  felt  that  the  scope  of  this  task  and  the  magnificent  way  in  which  it  was  accomplished  warranted  its  publi- 
cation in  a separate  chapter  of  this  volume.  J.  Shmoys,  session  chairman,  also  has  provided  additional  summary 
comments. 
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The  only  way  to  extract  maximur.  information  from  experimental  results  is  to  solve  inverse  problems.  When  a 
new  study  is  undertaken,  the  first  goal  of  the  physicist  is  to  obtain  satisfactory  fits.  This  can  be  done  either  by 
approximate  methods  or  numerical  computations.  However,  it  soon  becomes  obvious  that  “good  fits”  can  be  obtained 
that  correspond  to  fairly  different  data  on  the  unknown  quantities.  The  progressive  realization  of  the  consequences 
of  this  nonuniqueness  imply  an  evolution  of  both  the  methods  and  the  centers  of  ’nterest  in  inverse  problems.  This 
evolution  is  schematically  described  in  the  present  paper,  together  with  the  various  mathematical  methods  used.  A 
comparative  description  is  given  of  inverse  methods  in  scientific  research,  witn  '.xamples  taken  from  mathematics, 
quantum  ;;nd  classical  physics,  seismology,  transport  theory,  radiative  transfer,  electromagnetic  scattering,  electro- 
cardiology, etc.  It  is  hoped  that  this  paper  will  pave  the  way  foi  an  interdisciplinary  study  of  inverse  problems,  lead- 
ing eventually  to  the  design  of  experiments  that  are  most  efficient  to  give  information. 


INTRODUCTION 


Our  knowledge  of  the  world  proceeds  through  “models.”  In  my  given  model,  the  scientist  defines  a certain 
mapping  M of  a set  C of  functions  into  a set  f . The  “physical”  functions  are  the  elements  of  C that  are  mapped 
onto  the  set  Ee  of  experimental  results  The  mapping  M and  the  set-say  Ce -of  all  the  “physical”  functions  are 
altogether  called  the  physical  law. 

In  classical  or  quantum  mechanics,  the  mapping  consists  of  the  “kinematics”  and  C is  the  set  of  the  “dynamical” 
functions,  such  as  the  potential. 

In  seismology,  the  mapping  is  usually  defined  through  the  laws  of  propagation  of  elastic  waves.  The  set  C con- 
sists of  several  functions  of  position  such  as  the  density  the  bulk  modulus  K , the  shear  modules  u , the  local 
quality  factor  Q,  which  give  a description  of  the  earth  In  geochemistry,  C would  have  to  include  the  concentra- 
tion of  vanous  elements  or  compounds  as  a function  of  position  and  time.  In  geomagnetism,  it  would  include  electri- 
cal conductivity  as  a function  of  position. 

In  the  diffraction  of  electromagnetic  waves,  C.  would  include  functions  of  position  such  as  the  conductivity  of 
the  diffracting  body,  and,  most  important,  it  would  include  the  geometry  of  the  body-namely,  a boundary  condition 
in  the  wave  equations.  The  mapping  is  defined  th  ough  Maxewll’s  equations. 

The  same  general  schime  holds  in  the  transn  issiem  line  scattering,  but  it  is  distinguished  by  its  one-dimensional 
character  and  the  fact  that  waves  propagate  along  the  line  in  two  directions,  left  to  right  and  right  to  left. 

la  studies  of  extended  radiation  sources,  C includes  the  radial  distribution  of  the  emission  coefficients  (which 
in  turn  can  give  information  on  such  parameters  cs  temperature  and  density  of  the  emitting  matter).  The  mapping  is 
defined  through  the  laws  of  radiative  transfer.  More  generally,  in  radiative  transfer,  as  well  as  in  neutron  transport 
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theory,  the  set  C is  the  set  of  properties  of  a semitransparent  medium  relating  to  the  absorption,  emission,  and 
scattering  of  radiative  fields. 

In  electrocardiology,  C is  the  generator  data,  and  the  mapping  is  defined  through  the  Poisson  equation  and 
the  geometry  of  the  human  torso. 

In  the  mathematical  problem  of  “identification,”  M is  defined  by  a partial  differential  equation,  whereas  C 
enters  as  either  a boundary  condition  or  as  a parameter. 

Throughout  these  physical  or  mathematical  situations,  we  cncountei  the  same  general  scheme.  In  the  following, 
it  is  convenient  to  use  the  mechanical  terminology  and  to  call  C the  class  of  possible  dynamics,  Ce  the  class  of 
effective  dynamics,  M the  kinematics  of  the  problem.  Giving  C and  M , and  defining  the  measurements,  is  giving 
a model.  Now,  let  us  assume  that  we  perfectly  know  the  kinematics  of  a model.  There  are  two  ways  of  using  the 
model  to  increase  our  knowledge  of  the  world.  In  the  direct  problem , the  dynamical  functions  are  known,  either 
from  general  principles,  or  from  previous  information.  The  kinematics  enables  one  to  derive  from  the  dynamical 
functions  a set  of  numbers,  which  should  fit  the  experimental  data.  In  the  inverse  problem,  the  dynamical  functions 
are  derived  from  the  experimental  results  and  can  be  used  in  turn  as  fundamental  information  for  further  investigations. 

The  present  paper  is  devoted  to  a comparative  study  of  the  inverse  problems  in  physics.  The  most  straight- 
forward idea  for  such  a comparison  is  to  establish  a classification  of  the  physical  situations  in  which  an  inverse  prob- 
lem has  been  of  interest.  Suppose  first  we  put  the  examples  quoted  above  in  an  order  depending  on  the  field  con- 
cerned: classical  and  quantum  mechanics,  geophysics,  electromagnetic  theory,  radiative  transfer,  electrocardiology, 
and  mathematics.  Suppose  then  we  refined  our  classification-for  instance,  dividing  geophysics  into  seismology, 
geochemistry,  geomagnetism.  Clearly  the  net  result  of  such  a classification  would  be  to  enhance  the  uniqueness  of 
the  inverse  problems  in  various  fields,  and  to  make  rery  difficult  the  classification  of  certain  methods  such  as  electrical 
sounding  in  geophysics. 

A more  persuasive  v.  ay  of  making  a classification  is  to  describe  the  physical  situations.  In  the  example  quoted 
above,  we  first  encounter  scattering  studies,  in  wltich  the  experimental  results  give  information  on  the  asymptotic 
behavior  of  waves  (quantum  mechanics,  seismology,  electromagnetic  theory).  These  studies  reduce  generally  to 
Sturm-Liouville  problem'.  There  is,  however,  a large  qualitative  difference  between  the  problems  in  which  the  main 
phenomenon  is  a stationaiy  state  of  collision  (quantum  mechanics,  electromagnetic  theory)  and  the  problems  in 
which  progressive  waves  are  dominant  (seismology).  The  second  category  ' ' physical  situations  could  be  called  the 
“transfer  phenomena”  in  which  the  experimental  results  give  information  on  the  intensity  of  a transferred  energy 
(radiative  transfer,  neutron  transport).  A third  category  would  consist  of  “closed  situations”  in  which  the  experi- 
mental measurements  can  be  modified  during  experiments  according  to  the  information  previously  obtained  (electro- 
cardiology,  electrical  and  some  other  soundings).  It  is  clear  that  this  method  of  classification  is  disappointing.  The 
first  category  is  fairly  well  defined  and  could  justify  a study  by  itself.  The  two  other  categories,  although  they  are 
most  important  for  applications,  have  many  chances  to  contain  several  otherwise  disconnected  studies.  For  an  exten- 
sive study  of  inverse  problems,  this  classification  may  be  of  interest.  For  an  introduction  to  interdisciplinary  aspects, 
we  will  keep  it  as  a secondary  way  of  classification,  not  the  main  one. 

A third  way  of  making  a classification  would  be  to  emphasize  the  mathematical  distinction  between  a well-posed 
and  an  improperly  posed  problem.  Since  the  definitions  of  these  words  exhibit  important  fluctuations,  and  are 
generally  too  precise  for  our  purpose,  let  us  say  that  we  use  them  in  the  following  sense:  a well-posed  problem  is  a 
problem  in  which  the  mapping  M of  Ce  into  E is  a bijection,  provided  that  c is  the  class  of  ail  possible  experi- 
mental results  of  a certain  kind.  This  means  that  for  a well-posed  problem,  if  a given  element  of  E , a defined  set  of 
experimental  results,  is  known  exactly,  then  the  dynamical  function  can  be  obtained  completely.  In  practice,  the 
experimental  results  are  necessarily  affected  by  errors,  so  that  even  in  a “well-posed”  problem  the  dynamical  func- 
tions may  be  impossible  to  determine  completely  from  experimental  results,  especially  if  the  available  inversion 
methods  are  not  stable.  The  situation  is  much  worse,  however,  in  an  improperly  posed  problem,  in  which  E is  either 
too  email  a class,  so  that  an  infinity  of  dynamical  functions  correspond  to  an  “exact”  set  of  “experimental  results,” 
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or  too  large  a class,  so  that  certain  elements  of  E do  not  correspond  to  any  dynamics  at  all.  For  example,  in  quantum 
mechanics,  if  6 0(E)  is  a known  function  and  there  is  no  bound  state,  the  inverse  problem  for  8 ■ 0 may  be  well 
posed  in  our  sense,  although  physically  it  is  frivolous  to  speak  of  B0(E)  for  E -*  *>;  if  there  is  no  restriction  on 
the  bound  states,  an  infinity  of  potentials  correspond  to  60(£);  and  we  describe  the  problem  as  improperly  posed 
by  underdetermination ; if  fi  g(£)  is  known  for  all  8 and  all  E,  the  conditions  that  have  to  be  imposed  on  6 g(£f) 
to  ensure  the  existence  of  a solution  are  unknown,  and  the  problem  generally  is  improperly  posed  by  overdetermina- 
tion. Obviously,  improperly  posed  problems  also  may  exist  in  which  E is  neither  smaller  nor  larger  than  the  elass- 
say  E e -that  is  the  image  of  Ce  through  the  mapping  M . The  distinction  between  well-posed  and  improperly 
posed  problems  is  of  practical  importance  but  it  clearly  cannot  be  the  main  motivation  of  a classification. 

Ultimately,  the  most  practical  way  of  making  a classification  of  the  interdisciplinary  aspects  of  inverse  problems 
is  to  study  separately  the  various  steps  through  which  the  attempts  of  solution  of  such  a problem  may  proceed.  After 
each  step,  new  questions  are  asked  to  increase  the  knowledge  of  the  problem,  and  the  following  steps  attempt  to  give 
them  an  answer.  Since  the  questions  are  more  and  more  refined,  it  is  not  surprising  that  the  logical  order  of  this 
“quest  for  information”  is  also,  in  many  cases,  the  historical  order  of  the  studies  of  the  problem.  First  came  the 
approximate  methods,  followed  by  the  trial-and-error  methods.  Then,  generally,  authors  realize  that  the  solutions 
are  not  unique,  and  ask  a set  of  deep  questions  such  as  existence,  uniqueness,  constructibility,  approximation 
theory,  stability.  Then  the  incomplete  exact  solutions  come,  in  which  subsets  cf  E or  C0  are  used.  The  inci- 
dence of  experimental  errors  and  the  statistical  convergence  of  the  systems  identifications  are  studied  apart.  Ulti- 
mately, the  complete  solutions,  the  approximation  theories,  and,  of  course,  the  mathematical  generalizations  come 
to  prove  that  the  problem  is  now  a “classical  problem.”  Obviously,  there  is  no  real  problem  that  has  gone  through 
the  full  cycle  of  evolution  outlined  here.  Studies  of  problems  concerned  primarily  with  fundamental  physics 
(e.g.,  quantum  mechanics)  went  through  a large  part  of  the  cycle,  but  did  not  pay  much  attention  to  the 
question  of  experimental  errors.  On  the  other  hand,  studies  concerned  with  applied  physics  (e.g.,  the  various 
soundings)  paid  most  at'-ntion  to  the  experimental  errors  and  did  not  develop  vsry  interesting  exact  methods. 

In  this  introduction  to  the  interdisciplinary  aspects  of  inverse  problems,  we  outline  the  various  steps  of 
their  evolution,  with  emphasis  on  the  structures  of  the  methods  and  the  quantity  of  information  referred  to. 
Specialized  methods  of  each  discipline  are  minimized  as  a result.  The  similiarities  of  the  inverse  problems  in 
the  various  fields  are  emphasized  while  their  differences  are  left  to  more  detailed  studies. 


Examples  are  given  of  the  problems  noted  earlier,  particularly  those  in  quantum  mechanics  and  seismology, 
which  exhibit  almost  all  aspects  of  the  cycle  of  evolution.  Another  reason  for  this  emphasis  is  even  better:  the 
competence  of  the  author  rapidly  decreases  from  quantum  mechanics  to  clectrocardiology! 


MATHEMATICAL  NOTATIONS 


To  summarize  a physical  model,  we  use  the  notation  m with  an  index  to  denote  the  model,  V with  an  index 
to  denote  a particular  dynamics,  and  E with  an  index  to  denote  a particular  experimental  result.  A typical  direct 
problem  is  therefore  schematized  as 

(«o):  Vo(Vo€C)-+(U)-+Eo(E0ZE)  (1) 


and  a typical  inverse  problem  as 


94 


(m):  E0(Eoe  Ee)^V^V{€  Ce) 


(2) 


« 


In  all  the  cases  we  study,  it  is  possible  to  define  C and  E as  complete  metric  spaces.  The  direct  oblem  is  formu- 
lated by 


M(V)  = E VeC  EeE 


(3) 


Let  Eg  be  a subset  of  E , which  can  be  associated  with  the  set  of  all  possible  experime-ta!  results  of  a certain  type 
(for  instance,  all  the  functions  6 g(£)  continuous  and  going  to  zero  as  E goes  to  *>).  *ne  inverse  problem  is  the 
problem  of  obtaining  the  solutions  of  Eq.  (3)  in  C for  all  E 6 Eg  • The  problem  is  well  posed  (in  our  sense),  if  there 
is  one,  and  only  one,  solution  for  each  element  E of  Eg . Notice  also  that  our  definition  is  essentially  identical  to 
the  one  introduced  by  Hadamard  [1902]  of  a "probleme  parfaitement  bien  pose’,  je  veux  dire  possible  et  determine  .** 
It  is  only  more  recently  that  a third  condition  was  introduced  to  define  a properly  posed  problem  in  Had'  nard's 
sense,  namely,  that  the  solution  of  Eq.  (3)  depends  continuously  on  E [Lavrentiev,  1967] . This  condition  is 
essential  for  stability. 


APPROXIMATE  METHODS 


Classical  Methods 

In  a number  of  wave  propagation  problems,  the  variations  of  the  dynamical  functions  over  a given  wavelength 
are  so  small  that  diffraction  and  reflection  can  be  neglected.  Then  wave  groups  propagate  along  classical  trajectories 
to  the  next  discontinuity  surface;  the  parts  of  classical  trajectories  between  two  discontinuity  surfaces  are  rays . These 
rays  obey  Fermat’s  principle:  among  possible  paths  joining  any  two  assigned  points  in  the  medium,  the  actual  ray  is 
such  that  the  travel  time  along  it  is  extremal.  If  the  radius  of  curvature  of  the  surface  discontinuities  is  sufficiently 
large,  diffraction  can  also  be  neglected  to  study  a discontinuity,  which  gives  rise  to  a reflected  and  a transmitted 
wave,  both  propagating  along  rays,  so  that  a completely  classical  model  can  be  used  in  the  medium. 

Suppose  now  the  problem  has  either  spherical  or  cylindrical  symmetry,  and  is  a realization  of  one  of  the  two 
following  (equivalent)  situations:  the  paths  of  propagation  have  turning  points  in  the  medium  (quantum  mechanics, 
seismology),  or  the  source  of  radiation  is  an  extended  ball  (or  an  extended  cylinder).  Then  it  is  possible  to  relate 
measurable  quantities  to  the  dynamical  functions  by  an  Abel’s  integral  equation. 

Examples  of  such  a result  are  shown  in  the  inverse  scattering  problem  at  fixed  energy  in  classical  mechanics 
[Keller  etal.,  1956]  and  quantum  mechanics  [Firsov,  1953;  Wheeler,  1955;  Sabatier,  1966a  ; Vollmer,  1969; 

Miller,  1969] , where  the  formula 


( r / 

i+rr)  X//(X)dX 

/ 1 X1  - 

(4) 

v 2/  _ 

2 

obtained  through  the  JWKB  approximation,  is  typical.  In  Eq.  (4),  provided  the  potential  allows  only  one  “turning 
point,”  H(K)  is  bijectivdy  related  to  the  part  of  the  potential  V(r)  beyond  the  distance  of  closest  approach  rQ ; 

6 g is  the  phase  shift,  as  a function  of  the  quantized  angular  momentum  8 , and  can  be  constructed,  in  certain  con- 
ditions, from  the  experimental  results. 

A more  useful  example  is  the  study  of  extended  radiation  sources  where  external  measurements  of  the  radiance 
produced  by  ihe  source  are  used  to  deduce  the  radial  distribution  of  the  emission  coefficient  [Nestor  and  Olsen,  1960; 
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freeman  and  Katz,  1960 ,Bockasten,  1 96 1 ; Harr,  1962, Heriitz,  1963;  Maldonado  et  at.,  1965;  Oemers  and  Birkeback, 
1966;  AGnerbo  and  Levy,  1969].  Let  us  denote  by  Y(y)  the  measurement  of  the  ridiance  emitted  by  the  extended 
radiation  source  along  the  edge  of  a circular  slab  of  center  0,  in  a cross  section  taken  perpendicular  to  the  axis  of 
symmetry  Oz  (Ox  and  Oy  being  with  Oz  a rectangular  triedron).  Let  R(r)  be  the  emission  coefficient,  which  is 
assumed  to  vanish  beyond  a finite  radius  a.  Then 

Y(y)  * 2 f (r*  - y1  )_l /l  R (r)r  dr  (5) 


A still  more  remarkable  example  of  this  type  of  result  is  shown  in  seismolo^- , where  the  time  T of  travel  along  a ray 
can  be  determined,  as  a function  of  the  ray  parameter  p . by 

T(p)  « I *(T7)(tf‘  -/>*)-*'*  n dr?  (o) 

JP 


In  Eq.  (6),  rjc  is  equal  to  r0/v(r0),  where  r0  is  the  eartn  radius,  v(r)  is  the  speed  of  propagation  of  the  wave  along 
the  ray,  at  the  Doint  with  radial  coordinate  r , and  |(i?)  is  a function  related  bijectively  to  v(r),  provided  that  r/v 
decreases  monotonely  from  i)0  to  p as  r decreases  from  r0  to  say,  tp(tp  > r'  > 0).  The  problem  is  to  recover 
£(t?)  from  T(p)  (Hergton,  1907;  Wir chert.  1907; Bateman,  1910  Slichter,  1932 ,Bullen,  1956]. 

Integral  transforms  similar  to  Eqs.  (4),  (5),  and  (6)  also  appear  in  the  analysis  of  vertical  incidence  radio  sound- 
ings of  the  ionosphere  when  the  JWKB  approximation  is  used.  The  relations  between  the  group  time  delay  of  a pulse, 
or  its  transit  time  between  two  points,  and  a function  directly  related  to  the  electron  density  are  of  this  form, 
[Rydbetk,  1942;  Kay,  1971],  provided  that  the  last  function  is  monotonic,  which  ensures  the  uniqueness  of  the 
turning  point.  Again,  a similar  transform  was  given  by  Sh:  jys  and  Pimglla  { 197 1 ] , in  their  studies  of  nonradially 
stratified  media  by  trajectory  methods.  In  diagnosing  a two-dimensional  refractive  index  distribution  n(r,  6),  ex- 
pandable in  Fourier  series  with  coefficients  Nk(r),  they  prove  that  the  Fourier  component  lk(b)  of  the  phase  shift 
of  a ray,  characterized  by  its  impt-'t  parameter  b and  its  angle  ft,  is  related  to  Nk(r)  by 


Ik(b)  - J bNk(-~jeW( tinfi)-*  d0  (7) 

which,  for  k * 0 and  k ■ 1 reduces  to  a transform  similar  to  Eqs.  (4),  (5),  or  (6).  Oth'  r examples  can  be  found 
in  iriany  cases  where  a classical  approximation  is  used  in  a physical  problem.  The  simplest  one  is  orobably  the  his- 
torical Abel’s  problem:  determine  the  shape  of  a hill,  given  the  time  taken  by  a particle  to  go  up  kv3  return  as  a func- 
tion of  the  particle's  initial  velocity  [Abel,  1826] . 

The  formulas  (4),  (5),  and  (6)  can  be  considered  as  integral  equations  for  the  dynamical  functions  ]resp.  H( A). 
R(r),  {(»?)]  when  the  “experimental  fundior:”  [reap.  6g,  Y(y),  T(p)]  are  known.  The  integral  equation  is  of 
Abel's  type,  and  can  be  solved  exactly  if  6(8  ♦ 1/2),  T(y).  T(p),  are  known  differtntiable  functions!  For  instance, 
the  exact  inverse  of  Eq.  (5)  is 
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R(r)  = -it'1  f 0*  - r3)  m Y '(y)  d y 


Similar  formulas  hold  in  the  other  cases. 

Analysts  of  the  inverse  problem-  To  use  the  above  method,  we  first  must  assume  that  the  result  to  be  found 
makes  the  approximation  passible.  This  is  not  a trivial  assumption.  In  quantum  mechanics,  it  ir.-ans  that  we  have 
to  exclude  the  possibility  of  “orbiting.”  In  seismology  , it  means  that  v(r)  has  to  increase  steadily  as  r decreases. 

Now,  it  is  possible  to  weaken  this  assumption-including,  for  example,  special  velocity  variations  ( Builen,  l°5o, 

1963]  - without  implying  any  great  mathematical  difficulties,  but  only  a small  number  of  such  compilations  can 
be  taken  into  account.  In  any  case,  from  t!'.3  mathematical  point  of  view,  the  class  of  functions  in  which  the  solution 
of  the  problem  is  desired  is  not  very  well  defined . 

A second  difficulty  comes  from  the  experimental  errors  and  fluctuations.  It  fellows  from  them  that  the  exper- 
imental function  is  not  known  as  a differentiable  function.  Actually,  the  integral  transform  Eq.  (6)  is  equivalent  to 
half-order  differentiation  [Gorenflo  and  Kovetz,  1966] , and  therefore  noise  amplification  is  unavoidable.  However, 
there  has  been  no  complete  study  of  this  instability  up  to  now.  Several  authors  have  proposed  numerical  methods  to 
solve  the  Abel's  equation  using  Gaussian  integration,  interpolation  formulas  of  low  order  [Nestor  and  Olsen,  1960; 

FHe,  1963;  Bockasten,  1961 ; Edels  et  aL,  1962] , approximations  of  the  data  by  smooth  functions  [Freeman  and 
Katz,  1960;  Oemers  and Birkebck,  1966 \BuUen,  1961, 1964] , orthogonal  function  expansions  [Hr.rlitz,  1963; 
Maldonado  et  aL,  1965;  Gorenflo  andKovetz,  1966; Minerbo  and  Levy,  1969]. 

A third  difficulty  comes  in  quantum  mechanics.  In  the  other  cases,  if  convenient  assumptions  are  taken,  the 
problem  is  well  posed.  In  quantum  mechanics,  at  the  best,  only  integer  8 values  can  be  taken  into  account.  The 
net  result  is  that,  even  if  the  experimental  errors  were  canceled,  an  interpolation  woiud  be  necessary,  it  follows  from 
Loeffel  (1968]  that  the  choice  of  the  interpolation  makes  the  problem  well  posed  in  the  class  U of  potentials  whose 

jT*  p I V{p)  I dp  is  finite.  Unfortunately,  the  choice  of  an  interpolation  consistent  with  J WKB  approximation  is 

difficult  and  in  no  case  unique  [Sabatier,  1966a] . The  uniqueness  appearing  in  Eq.  (6)  is  misleading  because  the 
nonuniqueness  appears  in  the  step  5g  -*  5(2  + 1/2). 

Extensions  and  improvements  of  the  results-  In  quantum  mechanics  and  in  seismology,  as  well  as  in  the 
transmission  line  scattering,  the  formulas  quoted  above  are  obtained  through  the  JWKB  approximation.  This  approxi- 
mation can  be  considered  as  the  first  term  of  a series  of  powers  of  the  reflection  coefficient  [Bremmer,  1951]  or  similar 
paiameters.  Rather  surprisingly,  taking  into  account  the  next  order  of  the  approximation  again  yields  an  Abel’s 
integral  equation  [Sabatier,  1966a;  Vollmer,  1969] . The  JWKB  approximation  and  a fust-order  perturbation  can 
also  be  combined;  leading  to  an  inversion  method  where  both  smooth  and  sudden  (but  small)  variations  of  the 
dynamics  are  taken  into  account. 

Other  applications  of  classical  methods  to  inverse  problems-  The  ray  method  is  commonly  used  for  diagnosis 
in  electromapietic  scattering,  but  its  widest  field  of  application  is  seismology.  It  has  been  used  in  almost  all  the 
determinations  of  earthquake  parameters.  Builen  [1963]  provides  a comprehensive  review  of  tnese  applications. 

More  recent  theoretical  work  is  found  in  Gerver  tmd  Markushevich  [1966, 1967] . It  is  interesting  to  note  that  the 
ray  method  led  Mohorovicic,  in  1909,  to  detect  the  famous  discontinuity  below  the  surface  of  the  earth  that  has 
been  named  after  him. 

Usually,  the  most  important  experimental  measurement  used  in  the  method  is  the  arrival  time  of  the  wave 
although  many  other  parameters,  such  as  dispersion  curves  and  amplitudes,  can  be  taken  into  account.  However, 
the  filtering  effect  of  the  earth,  together  with  the  random  distribution  of  the  shells  in  the  crust,  prevent  precise 
analyses  of  the  spectra.  On  the  other  hand,  under  certain  conditions,  large  deformations  of  the  Mohorovicic 
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discontinuity  can  yield  converging  effects  through  which  the  amplitude  measurements  allow  an  easy  diagnosis 
[Beau fils,  et  aL,  1970] . 


Perturbation  Methods 

Let  us  assume  that  we  know  a certain  exactly  solvable  model  with  which,  for  a given  dynamic  VQ , depend- 
ing on  a few  parameters,  we  can  solve  completely  the  direct  problem 


Vo  - (M)  - Eo 


(9) 


Suppose  now  VQ  is  given  a perturbation  X V,  and  let  AE  be  the  modification  of  the  e\perimental  result.  It  may 
happen  that  AE  can  be  expanded  in  powers  of  X: 


AE  = X A0(K)  + X2A,(F)  + • • • (10) 

where  Ao(K),  A,(F).  are  certain  functionals,  which  also  depend  on  VQ.  A “perturbation  method”  is  possible  if 
I AE  - X Ao  V i is  much  smaller  than  AE  as  AE  goes  to  zero.  Then,  for  reasonably  small  deviations  to  the  solvable 
scheme  in  Eq.  (9),  it  is  possible  to  study  the  direct  problem  through  replacing  AE  by  XAqK  In  addition,  higher 
order  of  the  perturbations  expansions  can  be  calculated  in  certain  cases.  The  inverse  problem  can  be  handled  if  AqV 
is  inveisible.  Once  a value  K,  has  been  obtained  from  the  measured  AE,  it  is  possible  to  use  the  expansion  in  Eq.  (10) 
or  an  iteration  process  to  improve  the  precision. 

The  perturbation  method  has  been  used  in  several  inverse  problems.  In  quantum  mechanics,  it  yields  the  essen- 
tial part  of  the  methods  of  Froberg  [1947,  i948]  and  HyUenua  [1948]  for  inverse  problem  at  fixed  £;  it  has  also 
been  used  as  a first  approach  toward  the  approximation  problem  [Sabatier,  1967a]  in  the  inverse  problem  at  fixed 
energy.  In  seismology,  it  has  been  studied  by  Knopoff  [1960, 1961, 1962]  i i several  type  of  problems  [see  also  Mai 
and  Knopoff,  1968,  and  the  quoted  references] , including  a study  of  higher  orders  of  the  perturbation  expansion 
[Knopoff,  1962] . Applications  to  the  inverse  problems  for  surface  waves  have  also  been  given  by  Jobert  [19f0, 

1970] , and  to  the  inverse  eigenvalue  problems  in  seismology  by  Vareault  [1965] . 

Let  us  now  try  to  analyze  more  thoroughly  the  structure  of  perturbation  methods  in  (he  inverse  problems. 

(Such  an  analysis  would  have  avoided  well-known  errors  in  some  cases.)  In  typical  problems,  C and  E can  be  given 
the  structure  of  normed  linear  spaces,  and  M is  a differentiable  mapping  of  C into  E , namely,  a Frechet  differen- 
tiable functional.  Then,  if  VQ  and  V are  elements  of  C,  we  can  find  a linear  continuous  functional  6 M0(V)  such 
that 


AM  = M(V0  + V)  - M(Fc>)  = SM0(V)  + r0(V)  (11) 

where  r0(V)  is  a functional  whose  norm  is  infinitely  small  compared  to  I V I as  I V I goes  to  zero: 

(Ke)(e>0)  (J8)(S  > 0)(IKl  < 5)(lr0(K)  Ke  IKI)  (12) 

where  I - I denotes  the  norm  in  C and  I-  I the  norm  in  E.  Usually,  C is  a space  of  integrable  functions  and  the 
linear  function  8 Mo(V)  can  be  put  in  the  integral  form 
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l««o(K)]a  = /F(p)U(a,p)dp  = Ea  (13) 

where  u(a,  p)  is  a known  function  of  a and  p,  and  a is  a parameter  related  to  the  structure  of  E As  an 
example,  in  quantum  mechanics,  at  a given  energy,  in  the  (first-order)  Bom  approximation 


F (t>)  J g+ 1 12  (p)-f £+1/2  (p)p  d P 


(14) 


In  that  case,  a is  a discrete  parameter  and  E is  a sequence  space. 

Let  us  now  study  the  mverse  problem  in  the  first-order  perturbation  theory.  Let  us  assume  that  the  experimental 
function  £(<*)  is  perfectly  known  Then  Eq.  (13)  is  a generalized  moment  problem,  which  in  most  inverse  problems 
has  at  least  a solution.  New  let  C,  be  a space  of  functions  in  which  the  g.m.  problem  has  one  solution  only.  Clearly, 
if  the  solution  of  Eq.  (1 3)  is  sought  in  a set  larger  than  Ct , the  problem  is  undetermined.  This  is  the  case  for  Eq. 
(14)  if  F(p)  is  sought-say  in  La(0,  «>).  Actually,  with  the  usual  limitations  on  the  6g,  a unique  solution  could  be 
obtained  in  the  subspace  of  L2  (0,  °°)  containing  only  entire  functions  of  order  1 and  type  2 [Sabatier  1967a] . The 
conditions  on  u(a,  p)  for  which  Eq.  (13)  has  a unique  solution  in  a sufficiently  large  space  of  functions  are  generally 
not  met  in  the  physical  inverse  problems.  The  nonrealization  of  that  point  led  Froberg  [1948]  and  Hylleraas  [1948] 
to  an  incorrect  guess  [Bargnumn,  1949;  Levinson,  1949 \Hoimberg,  1952]. 

Let  us  denote  bj  N j a linear  functional  that  is  an  inverse  of  SM0,  with  norm  N, . Let  us  assume  that 
r0(V)  is  lipschitzian  in  a ball  (0, 5"),  and  that  the  product  L of  Nt  by  the  Lipschitz  constant  is  smaller  than  1 . 
(The  first  assumption  is  generally  true  in  physical  problem;  the  second  one  depends  on  N , .)  Then  the  mapping 
Ni  is  contracting  in  die  ball  of  center  O,  radius  6'  = 5"(1  - L): 

« W,  MF,)]  - N,  [r0(F2)]  \<  L 1 F,  - V2  I L < 1 05) 

To  obtain  a solution  of  AM  = E,  we  now  solve  F = N,[£-r0(F)]  by  iteration  through  the  fixed-point 
construction  procedure: 

F0)  = N,(L) 

F<,+1>  = N,(E)~  N,[r0  (f<0)]  (16) 

Clearly  , the  inequality  in  Eq.  (15)  and  a well-known  argument  [see,  for  example,  Shilov,  1965]  enable  one  to 
guarantee  that  if  I F\1  ) I < 5' , the  sequence  of  Eq.  (16)  converges  toward  a unique  solution  F.  Moreover,  Eq. 

(14)  is  a constructive  method:  after  a finite  number  of  steps,  it  yields  a result  F(0  such  that  II  F - V(0  S can  be 
bounded,  by  L,+l  (1  - L)-1.  Therefore,  to  each  inverse  of  the  linear  functional  6 M corresponds  one  in- 
verse and  one  only  of  the  functional  M such  that  I h/f(E)  I is  smaller  than  a certain  quantity.  Therefore,  the 
differential  5 M yields  a way  to  study  the  nonuniqueness  of  the  inverse  problem,  but  some  other  solutions  may 
escape  it.  Notice,  in  particular,  that,  8 Mo  being  linear,  either  N,  is  unique,  or  there  exist  an  infinity  of  them,  and, 
among  them,  some  with  a norm  so  large  that  the  method  above  fails.  It  also  yields  a way  to  study  the  approximate  i 
problem  and  the  fluctuations  due  to  experimental  errors,  but  again  with  the  same  limitation.  From  the  theoretical 
point  of  view  it  is  good  for  a first  study  but  not  sufficient  for  a complete  one. 

From  the  practical  point  of  view,  the  use  of  a perturbation  method  has  the  same  defects  as  classical  methods. 

It  requires  a strong  a priori  assumption,  and  experimental  errors  are  still  a cumbersome  complication. 


n. 
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Construction  of  perturbation  methods - In  most  cases,  the  inverse  problem  is  defined  by  a linear  partial 
differential  equation,  and  a certain  set  B of  boundary  conditions.  Assume  that  the  differential  equation  is  of  the 
form: 


D0(  r)X  = p(r)X 


(17) 


where  D0(r)  is  a well-known  differential  operator  in  Rn , such  that  the  solutions  of 


D0{r)X0  = 0 


(18) 


with  the  boundary  conditions  B,  are  well  known,  and  p(r)  contains  the  part  of  the  dynamics  which  can  be  treated 
as  a perturbation.  Assume  also  that  the  Green's  function  corresponding  to  Z)0(r)  and  8 is  well  known: 

Da(r)  G(r,r)  = 6(r,r)  (19) 


Then  X is  a solution  ot  the  integral  equation: 

X(r)  = X0(r)  + Jc(r,p)p(p)X(p)dp  (20) 

If  the  Neuman  series  for  Eq.  (20)  converges,  it  yields  a “perturbations  expansion”  for  X(r).  Tht  convergence  of  this 
series,  however,  is  not  always  necessary  to  get  a good  approximation  of  X(r)  by  the  first-order  (the  Born 
approximation): 


*>(r)  = X0(r)  + J G{t,p)p[p)Xo{p)ip  (21) 

Now,  to  make  a measurement  M{  of  X(r),  we  obtain  the  values  of 

$M/(»)-X(r)dr  (22) 

where  Mi(r)  is  defined  from  the  experimental  measurement,  with  the  linear  approximation  in  Eq.  (21)  in  place  of 
X(f);  therefore,  we  get  numbers  of  the  form  in  Eq.  (13). 


NUMERICAL  METHODS 


The  discussion  of  numerical  methods  logically  should  follow  a complete  analytic  study  of  the  mathematical 
problem.  However,  since  the  purpose  of  a physicist  is  to  obtain  results,  numerical  methods  come  in  the  study  of 
inverse  problems  as  soon  as  a constructive  method  is  available,  either  exact,  or  approximate.  One  might  question  the 
relevance  of  numerical  methods  in  a study  of  inverse  problems.  After  all,  most  numerical  methods  are  nothing  but 
applications  of  the  methods  of  solution  of  the  direct  problem  to  various  dynamical  functions,  comparison  with  the 
experimental  results,  and  choice  of  the  best  approximation.  But  the  whole  development  of  physics  goes  this  way; 
we  cannot  encompass  in  our  study  the  whole  of  physics!  On  the  other  hand,  many  analytic  methods  to  construct  a 
solution  of  inverse  problems  go  also  this  way,  since  they  proceed  by  successive  approximations.  In  short,  if  we  do 
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not  include  in  our  study  all  the  fitting  procedures,  we  have  to  make  a distinction  between  the  numerical  computa- 
tions according  to  the  nature  of  the  information  Jiey  are  able  to  extract  from  the  experimental  results,  not  accord- 
ing to  tiie  very  structure  of  the  method.  Such  a distinction  is  very  arbitrary. 

Th:  ughout  the  papers  devoted  to  numerical  computations  to  fit  the  experimental  results  with  the  help  of  trial 
dynami  r i'  functions,  we  see  two  kinds  of  studies,  through  wnich,  so  to  speak,  the  position  of  the  guess  is  moving. 


Completely  Numerical  Computations 

These  are  methods  in  which  the  mathematical  transcription  of  the  physical  processes  is  handled  on  the  com- 
puter from  the  very  beginning.  For  each  step  of  the  physical  problem,  there  is  a corresponding  step  of  the  program. 
The  direct  problem  is  solved  in  this  way,  for  several  possible  dynamics,  among  which  the  program  enables  one  to 
choose, : : as  to  get  the  optimal  fitting  of  the  experimental  results. 

L'  us  take  for  example  a one-dimensional  process  where  a (partially)  measurable  quantity  U is  related  to  a 
dynamic  d function  a of  the  medium,  to  be  defined  for  x € [0,  j ] , by 


D(a,U)  = 0 


(23) 


and  boundary  conditions  on  certain  components  of  U,  say  0)  and  The  inverse  problem  is  to 

infer  tlie  function  e(x)  from  measurements  of  a component  of  U at  various  points 


U0(x{)  = w,-  i = 1,2, . . . ,Af 


(24) 


The  oroblein  can  be  discretized  by  introducing,  to  cover  the  medium,  N homogeneous  sections  (fe,-,fcl+ j ),  where 
the 1 dues  m of  the  desired  dynamical  function  are  selected  so  as  to  minimize  the  sum  of  the  square  of  the 
deviations: 


M 

S = 7!  [£/0(xf)  - w,]J  (25) 

i=l 


In  dynamic  programming  the  minimization  is  done  at  each  step,  through  a function 


fK{c>,c%)  = ^ [«(*,■)  - w,]  2 (26) 

f=l 


The  equation  D(aj , U)  is  curved  inside  each  section  (fy,fy+  j),  for  / = 1, 2, . . . ,K,  with  the  boundary  conditions 
Cj  and  Cj  (which,  ultimately,  for  K -N,  should  fit  0)  and  C/^'(h^r+i)],  and  with  appropriate  continuity 
requirements  of  U at  the  boundaries  of  the  sections.  The  minimization  of  is  done  on  the  coefficients  a,-  at 
each  step  , r d is  the  number  of  measurements  on  the  first  K intervals,  with  K = 1 , 2, . . . , N.  Sequences 
of  function  .s  are  thus  obtained,  which  solve  the  problem.  The  above  way  of  proceeding  is  convenient  for  inverse 
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problems  of  transport  theory  [Bellman  et  al.,  1967],  diffusion  processes,  or  radiative  transfer,  when  the 
measurements  are  done  all  along  the  physical  processes. 

In  most  cases,  however,  the  measurements  are  made  only  at  a boundary  of  the  medium.  Then,  if  we  call  “trial 
results”  the  images  of  the  trial  dynamical  functions  in  the  mapping  M , we  see  that  the  direct  problem  has  to  be 
solved  completely  for  every  trial  result,  and  the  deviation  of  these  results  from  the  observed  ones  has  to  be  computed. 
The  trial  functions  in  be  investigated  in  various  ways. 

In  the  “Monte  Carlo”  method,  first  a procedure  is  introduced  to  generate  random  dynamical  functions.  Next, 
it  is  necessary  to  specify  a set  of  tests,  whether  direct  or  indirect,  that  must  be  satisfied  by  the  functions  to  be  chosen. 
Direct  tests  are  a priori  properties  or  conditions  (e.g.,  bounds)  on  the  functions.  Indirect  tests  are  essentially  com- 
parisons of  the  trial  results  and  the  experimental  results.  In  these  comparisons,  the  allowed  differences-say  6 
should  be  smaller  than  the  errors  (roundoff  or  discretization)  coming  from  the  numerical  computations.  In  addition, 
the  test  is  precise  if  the  differences  between  the  bounds  defining  the  experimental  data  are  smaller  than  the  6,  . Now, 
a random  dynamical  function  belonging  to  the  class  defined  by  the  direct  tests  is  accepted  if  it  satisfies  all  the  indirect 
tests.  Clearly,  the  great  interest  of  this  method  is  that  it  works  as  well  for  an  underdetermined  problem  and  for  a well- 
posed  problem.  Even  more  important,  this  method  is  able  to  give  nonprogrammed  information:  if  the  totality  of  the 
tests  contain  more  information,  about  the  dynamical  functions  and  certain  indirect  tests  than  the  direct  tests  and  the 
experimental  error  bounds  indicate,  then  at  the  end  of  an  application  of  the  method  to  a suitably  large  number  of 
trial  functions,  the  trial  functions  and/or  the  trial  results  corresponding  to  the  nonprecise  tests  will  lie  in  clearly  dis- 
cernible subregions  in  C . Such  a method  is  particularly  suitable  for  the  (widely  underdetermined)  inverse  problems 
of  geophysics  [Asbel,  et  al.,  1966;  Levshin  etal.,  1966;  Kalis-Borok  and  Yanovskaya  et  al.,  1967;  Yanovskaya,  1963; 
Press,  196&-,  Anderssen  and  Seneta,  1970] . However,  the  method  possesses  also  a great  disadvantage,  for  the  results 
of  the  trials  already  made  are  not  used  in  the  next  trial.  In  the  neighborhood  of  the  best  fit,  if  the  “best  fits"  are 
isolated  in  C,  a direct  optimization  method  can  save  time,  so  that  one  ought  to  combine  the  two  procedures.  It  is 
also  possible  to  introduce  an  estimation  procedure  that  formalizes  the  procedure  for  implementing  and  interpreting 
the  results  obtained  [Anderssen  and  Seneta,  1970] . 

In  a well-posed  problem,  a systematic  method  of  optimization  is  generally  preferable  to  a random  method. 
Usually,  the  trial  dynamical  functions  are  chosen  in  classes  described  by  a few  parameters  and  the  optimization  is 
done  on  these  parameters,  a great  variety  of  classical  numerical  methods  being  used  to  solve  the  direct  problem  of 
going  from  the  trial  functions  to  the  trial  results.  See  Hodgson  [1963]  for  nuclear  physics;  Keilis-Borok  and  Yanov- 
ska'-a  [1967]  for  geophysics;  Kagiwada  and Kalaba,  [1968 ], Bellman  etal.,  [1963, 1965, 1967, 1968]  for  radia- 
tive transfer  and  diffusion  processes;  Gelemterand  Swihart  [1964]  for  electrocardiology;  and  Chavent  [1970]  for 
the  general  mathematical  problem.  Qearly,  the  advantage  of  choosing  the  trial  dynamical  functions  inside  a class 
depending  on  a few  parameters  only  is  that  first  the  problem  becomes  well  posed,  and  also  much  computer  time 
can  be  saved.  Unfortunately,  the  price  to  be  paid  is  very  high:  a method  constructed  in  this  way  can  give  only 
biased  information,  severely  restricted  by  the  class  of  trial  functions.  A good  example  is  given  by  the  numerical 
computations  made  in  nuclear  physics  to  fit  the  experimental  cross  sections  by  means  of  an  optical  potential.  A 
three-parameter  Woods-Saxon  potential  is  generally  used.  Experimentalists  are  quick  to  note  that  even  if  all  the 
possible  data  (differential  cross  section  and  polarization)  are  available,  there  are  still  some  small  ambiguities.  They 
usually  ascribed  these  small  ambiguities  to  experimental  uncertainties,  and  took  for  granted  that  changing  the  form 
of  the  trial  functions  would  not  make  drastic  modifications.  However,  for  certain  classes  of  trial  functions,  there 
can  be  drastic  modifications  [Sabatier  and  Quyen  Van  Phu,  1971]. 

Now  we  see  that,  in  all  these  numerical  methods,  there  is  somewhere  a guess  on  the  dynamical  functions.  To 
apply  any  of  them,  and  particularly  in  dynamic  programming,  it  is  necessary  to  assume  that  the  problem  can  be  dis- 
cretized. This  implies  a “sufficient  continuity”  of  the  functions.  Besides,  in  most  cases,  cheap  pa-ametrizations 
impose  further  restrictions.  The  method  with  the  largest  number  of  “degrees  of  freedom”  would  probably  be  the 
Monte  Carlo  method,  if  only  an  infinite  computer  time  could  be  financially  supported. 
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Reduced  Numerical  Computations 


These  are  methods  in  which  a preliminary  step  has  reduced  the  problem  to  a simply  defined  mathematical  one, 
usually  an  integral  equation.  In  some  cases,  the  preliminary  step  is  done  by  approximate  methods,  hence  introduc- 
ing what  can  be  called  a badly  defined,  or  a “hidden”  guess.  In  many  other  cases,  the  preliminary  step  is  exact. 

Then,  the  good  definition  of  the  second  step  makes  it  possible  to  study  the  uniqueness  or  the  stability  of  the  solu- 
tions much  more  fruitfully  than  in  the  completely  numerical  computations. 

Examples  of  this  type  of  approach  are  shown  in  radiation  measurements,  where  a Fredholm  equation  of  the 
first  kind  relates  the  unknown  function  fiy)  to  the  measured  function  g(x) 

fb  K(x,y)f(y)Ay  - g(x)  (27) 

* a 


Such  an  equation  has  notoriously  unpleasant  mathematical  properties.  First,  it  can  easily  be  seen  that  no  solution 
exists,  for  a given  K(x  y),  if  ;(r)  is  an  arbitrary  continuous  function.  For  example,  one  may  check  that  if  the 
function  x •*  K(xy)  satisfies  a linear  differential  equation,  with  coefficients  depending  on  x only,  g(x)  must 
also  satisfy  this  equation.  If  A'(xjO  is  a degenerate  kernel,  equal  to^^r(x)Br(y),  no  solution  can  be  found 
unless  ;(x)  is  a combination  of  the  functions  Ar{x).  Now,  if  the  eigenfunctions  of  the  kernel  are  known: 


fb  K(xy)fr(y)  Ay  = \rfr{x) 


(28) 


if  no  eigenvalue  is  zero,  and  if  the  eigenfunctions  form  a complete  set  in  the  space  E,  so  that  g(x)  =^brfr(x) , then 
fix)  ■ s.a,fr(x),  where  ar  - brl\r.  This  does  not  work  for  a degenerate  kernel,  which  has  at  least  one  zero 
eigenvalue.  Practically,  these  difficulties  generate  a lack  of  stability  of  the  “approximate”  solution  with  regards  to 
the  right-hand  side  of  Eq.  (27).  There  are  several  ways  of  reducing  Eq.  (27)  to  a matrix  equation 


KfM 


(29) 


for  instance,  by  numerical  quadrature  of  the  left-hand  side  of  Eq.  (27).  But  if  the  right-hand  side  is  intended  to  be 
treated  exactly,  and  if  the  number  of  points  in  the  quadrature  formula  is  increased,  ill  conditioning  and  instability 
will  be  the  general  lot.  On  the  other  hand,  the  problem  of  finding  a smooth  function  fix)  for  which  the  remainder 

«(*)  - fb  *(xjO  fiy)  Ay  - gix)  (30) 

• a 


is  in  some  sense  small,  and  is  well  posed  and  physically  meaningful.  The  smoothing  can  be  done  using  some  a priori 
knowledge  of  the  unknown.  Several  methods  have  been  used  in  this  framework  for  inferring  temperature  profiles 
from  stimulated  emission  measurements  [Mazuel,  1968;  Westwater,  1970] . In  more  recent  studies,  the  minimiza- 
tion of  e(x)  is  understood  as  a minimization  over  a statistical  ensemble.  In  the  linear  statistical  method  of  Strand 
and  Watmter  [1968c,  6, c] , the  coefficients  of  linear  combinations  are  determined  by  minimizing  the  expected 
mean  square  error  in  the  solution  when  averaged  over  a representative  ensemble  of  profile  and  unconelated 
measurement  errors. 
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Several  examples  of  inverse  convolution  problems  can  be  found  in  geophysics  [seismology  or  magnetic  inter- 
pretation, as  in  Bott,  1967]  and  in  optics.  The  problems  reducing  to  an  Abel's  integral  equation,  discussed  earlier, 
are  of  this  form.  Other  examples  leading  to  the  numerical  inversion  of  integral  operators  are  generally  studied  by 
well-known  numerical  methods.  It  is  worth  mentioning  that  in  many  cases,  there  are  several  possible  reductions  of 
a given  physical  problem  (several  integral  equations  giving  a key  to  the  solution).  Some  of  these  possible  reductions 
may  be  better  than  others  for  numerical  computations.  It  may  also  happen,  in  certain  cases,  that  a new  way  of 
reducing  the  problem  is  physically  better,  because  it  automatically  bars  unphysical  solutions.  An  example  is  given 
in  ground  electrical  sounding,  where  the  formulation  used  by  Kunetz  and  Rocroy  [1970]  generates  only  sequences 
of  positive  resistivities. 


Mathematical  Foundations  of  the  Numerical  Methods  in  Inverse  Problems 

Rather  recently,  there  has  been  interesting  mathematical  work  on  the  general  problems  of  convergence,  optimi- 
zation, statistical  convergence,  and  filtering  in  well-posed  inverse  problems  (there  called  “identification”  and  “estima- 
tion” problems).  Several  strong  theorems  have  been  proved  by  topological  methods  [Bensoussan,  1969] . 


Critical  Comments  on  the  Numerical  Methods 

The  greatest  asset  of  these  methods  is  that  they  work  in  all  the  cases  for  which  the  direct  problem  can  be 
solved  numerically.  Their  greatest  defect  is  that  they  give  biased  information.  The  trial  functions  have  to  be  chosen 
in  subclasses  of  C defmed  essentially  for  the  purposes  of  the  numerical  calculations.  Certain  of  the  conditions  are 
simultaneously  physical  conditions.  There  is  no  lack  of  information  if  all  the  conditions  are  really  physical  condi- 
tions-that  is,  conditions  implied  by  the  physical  laws.  In  many  cases,  however,  the  so-called  “physical”  constraints 
follow  from  previous  experimental  tests.  In  other  words,  they  follow  from  the  known  accumulation  of  previous 
“inverse  problem”  solutions  in  a certain  subclass  of  C ■ Since  these  inverse  problem  solutions  often  are  obtained 
by  poorly  conditioned  trial  and  error  methods,  the  constraints  on  this  subclass  of  C should  not  be  assumed  in  sub- 
sequent analysis.  Rather,  the  investigation  of  the  trial  solutions  should  be  as  broad  as  possible,  and  the  physicist 
should  have  always  in  mind  that  nothing  proves  that  the  path  toward  the  complete  knowledge  of  a physical  law  is 
unique. 

On  the  other  hand,  if  the  problem  is  well  posed,  it  reduces  to  a problem  of  “optimal  control”  and  can  be 
treated  by  well-known  numerical  methods  [Lions,  1968] . 


FUNDAMENTAL  QUESTIONS 


To  summarize  the  above  analysis:  What  kind  of  knowledge  do  the  approximate  and  the  numerical  methods 
afford?  Given  the  experimental  results,  what  is  the  dynamical  function?  These  methods  yield  the  answer,  “Here 
is  a dynamical  function  that  for  the  given  kinematics  yields  results  approximately  equal  to  the  experimental  measure- 
ments.” 

What  kind  of  knowledge  would  we  like  to  obtain?  If  we  know,  from  previous  information  or  from  a general 
principle,  a plausible  dynamical  function,  then  we  would  like  to  know  whether  the  corresponding  results  do  fit  the 
actual  measurements.  If  the  only  previously  obtained  information  enables  one  to  reduce  the  number  of  the  arbi- 
trary parameters  characterizing  the  desired  dynamical  function,  then  it  is  interesting  to  use  trial-and-error  methods 
to  solve  the  inverse  problem.  However,  we  must  realize  that  the  “proofs”  obtained  by  these  methods  can  always  be 
questioned,  while  the  counterexamples  they  may  give  cannot.  Take,  for  example,  the  fitting  cf  cross  sections  by 
means  of  a complex  potential  of  the  Woods-Saxon  form.  It  hat  been  recognized  that  a smooth  variation  of  the 
potentials  with  respect  to  the  kinetic  energy  is  necessary  for  a good  fit.  We  have  every  reason  to  believe  that  such  a 
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dependence  is  “physically  correct,”  since  according  to  the  theory  the  constructed  potential  should  be  equivalent 
with  a nonlocal  one.  From  the  point  of  view  of  inverse  problems,  however,  the  fact  that  a static  Woods-Saxon 
potential  cannot  fit  the  experimental  results  at  all  energies  does  not  prove  that  no  static  potential  can  do  it,  since 
this  inverse  problem  is  improperly  posed  by  underdetermination! 

Therefore,  what  we  would  like  to  obtain  from  inverse  problem  studies  is  not  only  the  most  probable  dynami- 
cal function,  but  the  total  amount  of  information  contained  in  experimental  results.  This  means  that  we  must 
evaluate  our  results  in  terms  of  the  following  factors. 

1 . Existence.  Does  a dynamical  function  exist  in  C that  gives,  through  the  given  kinematics,  the  proposed 
experimental  result?  In  other  words,  if  £,■  is  an  “experimental  result,”  is  there  an  element  Q of  C that 
M maps  onto  £,?  When  Cj  does  exist,  it  is  called  a solution  of  the  inverse  problem  corresponding  to  the 
experimental  result  £,-. 

2.  Uniqueness.  If  £,•  is  given  and  Q exists  in  C , is  Cj  unique?  The  answer  is  positive  if  the  inverse 
problem  is  what  we  called  “well-posed.”  From  the  mathematical  point  of  view,  a perfect  experimental 
result  is  simply  an  element  Ej  of  E . A real  experimental  result  £,•  is  actually  the  set  of  all  the  elements 
of  E that  are  consistent  with  the  measurements  and  their  possible  errors.  Since  in  practical  cases  Ej  is 
given,  not  Ej,  there  cannot  be  uniqueness.  Therefore,  it  should  be  understood  that  the  question  of  unique- 
ness. as  well  as  3, 4,  and  5 below,  involve  £,-,  not  £,-.  Only  in  the  later  step  do  we  allow  £,  to  take  on 
successively  all  values  in  the  set  Ej. 

3.  Constructibility.  It  is  not  sufficient  to  know  that  a function  exists.  For  a physicist,  a much  more  impor- 
tant question  is:  Can  a solution  of  the  inverse  problem  be  constructed  for  any  element  Ej  of  E ? By 
“constructive  method,”  we  mean  one  in  which  it  is  possible  to  obt.'.a,  after  a convenient  finite  number  of 
steps,  any  desired  approximation  of  the  solution. 

4.  Complete  constructibility.  When  the  answer  to  2 is  negative  we  also  are  led  to  the  question:  Can  all  the 
solutions  of  the  inverse  problem  belonging  to  a given  class  C,  ( C,CC)  be  constructed?  In  the  following, 
we  call  equivalent  two  solutions  of  the  inverse  problems  corresponding  to  the  same  result  Ej.  Now,  suppese 
we  can  construct  a set  fi  of  equivalent  potentials.  It  is  important  to  compare  the  elements  of  f , , or  the 
elements  of  subsets  of  f t defined  by  some  a priori  “physical  assumptions.” 

5.  Approximation.  We  are  therefore  led  to  the  question:  Is  it  possible  to  define  a distance  in  C i , and  to  get 
definite  bounds  for  the  diameters  of  the  sets  f i O Ci  ? Clearly,  questions  3, 4,  and  5 are  essentially  con- 
cerned with  undetermined  problems.  Consider  now  a well-posed  problem  ( if  the  problem  is  underdeter- 
mined, it  is  theoretically  possible  to  restrict  f ( so  as  to  obtain  a well  posed  problem),  and  let  us  allow  Ej 
to  take  various  vrlues  in  Ej.  It  is  most  important  to  know  whether  f,  show  small  or  targe  perturbations 
when  a small  perturbation  is  imposed  on  £). 

6.  Stability.  We  now  ask:  With  E and  C , being  metric  spaces,  is  the  mapping  of  C onto  C t (the  solu- 
tion of  the  inverse  problem)  continuous?_A  related  question  concerns  the  efficiency  of  a series  of  tests. 

If  a series  of  random  tests  yields  results  Ej(n\  and  if  the  corresponding  dynamical  functions  lie  in  the  sets 
£i(n),  does  the  intersection  of  these  sets  steadily  go  “toward  a unique  element”? 

We  have  touched  on  these  questions  in  earlier  sections.  However,  the  studies  described  there  were  much  more 
concerned  with  obtaining  a solution,  whatever  it  may  be,  than  with  information  problems.  In  the  next  two  sections, 
the  quest  for  information  is  pursued  methodically.  The  difference  in  the  studies  described  does  not  reside  (in  most 
cases)  in  the  nature  of  the  methods  they  contain  but  in  the  quality  of  the  information  they  seek. 


FORMAL  METHODS  AND  INCOMPLETE  EXACT  SOLUTIONS 


Scarcely  Parametrized  Methods 

The  simplest  way  to  construct  an  exact  method  is  to  reduce  so  drastically  the  class  of  possible  dynamic  functions 
that  the  direct  problem  can  be  exactly  solved  by  analytical  methods.  The  inverse  problem  then  reduces  to  determining 
a very  few  parameters  by  elementary  operations  and  matrix  inversions.  On  the  other  hand,  the  necessary  consistency 
conditions  on  the  results  are  very  strong.  They  can  be  fulfilled  only  if  the  accutacy  of  experimental  results  is  very 
poor.  In  practice,  this  method  is  used  mainly  to  give  models  that  show  the  importance  of  a particular  type  of  experi- 
mental result.  More  or  less  academic  examples  of  this  kind  can  be  found  in  many  areas,  such  as  the  one-  or  two-layer 
models  of  seismology  [Ewing  et  al.,  1958] . 


Special  Infinite  Gasses  of  Dynamical  Functions 

Physical  considerations  are  sometimes  sufficient  to  select  in  C an  infinite  class  of  dynamical  functions  that, 
for  some  reason  are  physically  more  meaningful  than  the  others.  In  quantum  scattering,  an  example  of  this  kind  is 
shown  in  the  class  of  superpositions  of  Yukawa  potentials,  which  are  favored  by  certain  theoretical  arguments: 

V(r)  * r~l  f C(r.)  exp[-ar]  da  (31) 

Another  class  corresponds  principally  to  “simplicity”  motivations,  the  class  of  truncated  potentials-those  that  vanish 
beyond  a finite  radius.  The  same  kind  of  motivations  favors  finite  smooth  bodies  in  electromagnetic  scattering.  In 
these  scattering  problems,  the  asymptotic  properties  of  the  waves  can  be  represented  with  the  help  of  a transition 
matrix,  part  of  which  is  measurable  in  scattering  experiments.  From  the  properties  of  the  special  classes  investigated, 
one  can  derive  analytic  properties  of  the  transition  matrix  T,  which  in  turn  permit  derivation  of  the  matrix  from  its 
measurable  portion  by  a process  of  analytic  continuation. 

In  the  class  of  Yukawa  potentials  for  the  quantum  inverse  problem  at  fixed  C [Martin,  1961 ; Chadan,  1962] 
and  at  fixed  E [Martin  and  Targomki,  1961] , the  construction  of  the  dynamical  function  from  the  transition  func- 
tion involves  the  use  of  a dispersion  formula  and  a special  iteration  procedure.  In  the  class  of  truncated  potentials 
[Marchenko,  1957,  Laiffel,  1968] , his  derivation  is  more  formal.  In  electromagnetic  scattering  [Weston  and  Boerner, 
1967] , the  Held  is  readily  calculated  outside  the  convex  envelope  of  the  body.  It  remains  to  locate  the  surface  of  the 
(convex)  body  by  taking  into  account  necessary  conditions  on  the  boundary. 

What  is  the  value  of  these  methods  for  the  inverse  problem?  The  inverse  procedures  are  broadly  formal.  Insta- 
bility for  random  perturbations  of  the  data  appears  in  the  analytic  continuations.  In  addition,  the  necessary  consis- 
tency conditions  on  the  experimental  result  are  not  really  known  (but  they  are  probably  not  very  strong).  There- 
fore, these  methods  have  hardly  much  practical  value.  On  the  other  hand,  they  are  important  from  the  theoretical 
point  of  view,  because  their  solution  is  generally  unique  and  physically  meaningful. 

Although  it  is  difficult  to  say  that  physical  considerations  have  suggested  the  choice  of  separable  potentials,  we 
put  here  the  study  of  their  inverse  problem  because  it  has  common  characteristics  with  the  above  ones.  Again,  inte- 
gral equations  containing  Hilbert  transforms  that  come  from  dispersion  relations  are  a key  to  the  problem  (the  so- 
called  Mushkelishvili  equations).  The  inverse  problem  treated  in  this  way  is  probably  a go 'd  approach  to  the  general 
inverse  problem  of  nonlocal  potentials.  It  has  first  been  studied  for  theoretical  reasons  [l  urdin  and  Martin,  1957, 
1958;  Chadan,  1958, 1967]  but  has  more  recently  gained  the  favor  of  nuclear  physicists  [Bolsterli  and  Mackenzie,  1965; 
Mills  and  Reading,  1969;  Tabakin,  1969] , who  could  have  saved  much  time  by  reading  the  previous  references.  One 
of  the  most  remarkable  successes  of  this  problem  is  the  result  of  Chadan  [1967]  concerning  the  possibility  of 
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cancelling  by  a separable  potential  the  phase  shifts  generated  by  a local  potential,  obtaining  in  this  way  a potential 
that  is  transparent  at  all  energies  and  therefore  cannot  be  distinguished  from  the  potential  0 by  scattering  experi- 
ments! No  example,  we  think,  better  demonstrates  the  surprising  results  that  the  physicist  can  reach  in  a study  of 
question  2. 


Formal  Methods 

The  inverse  scattering  problem  at  fixed  £ in  quantum  mechanics  has  given  rise  to  a large  number  of  studies 
from  which  some  methods  have  emerged  that  have  a large  domain  of  applicability.  The  Jost  and  Kohn  [ 1 962] 
method,  for  instance,  in  which  the  transition  matrix  is  expanded  in  powers  of  the  interaction,  has  been  used  as  well 
in  electromagnetic  scattering  (/hosser,  1971] . The  method  of  Hylleraas  [1948]  although  it  yields  only  one  so  u- 
tion  of  an  nnderdetermined  problem,  has  recently  gained  some  favor  among  chemical  physicists.  However,  no 
method  has  been  so  generally  applicable  or  given  rise,  by  analogy,  to  so  many  other  methods,  as  the  Gelfand-Le  itan 
method. 

There  are  two  aspects  to  the  Gel’fand-Levitan  method.  The  first  is  its  relation  with  the  spectral  problem  for 
differential  operators;  the  second  is  the  “algebraic"  structure  of  the  method  itself.  The  first  one  expresses  the 
mathematical  contents  of  the  method,  and  the  second  expresses  the  constructive  nature  of  the  method  for  the  study  of 
inverse  problems.  We  briefly  consider  the  first  aspect  in  the  next  two  sections,  referring  the  reader  to  the  original 
paper  of  Gelfand  and  Levitan  (1951  ] and  the  excellent  review  paper  of  Faddeev  11963] . As  for  the  second  aspect, 
it  is  interesting  to  draw  the  scheme  of  the  method  because  it  works  for  so  many  generalizations  and  analogous 
methods. 

We  are  interested  in  the  construction  of  the  potential  K(r)  from  the  phase  shift  of  the  regular  solution  of 

[D(r)  + k *]  *K(*,r)  = 0 (32) 


where 


D(r)  = D0(r)  - V(r) 


(33) 


and  the  normalization  of  $y(k,r)  is  chosen  in  such  a way  that  <py(k,r)  ~ r as  r goes  to  zero.  The  asymptotic 
behavior  of  0y(*,r)  is  denoted  by 


<t>y(k,r) 


■/(*)  1 

k 


sin  [kr  + 5(k)J  + o(l) 


(34) 


The  function  6 (k)  is  the  “experimental  result."  Now,  the  key  of  the  Gel’fand-Levitan  equation  is  a function 
K0  ^{r,  r),  referred  to  variously  as  the  “transformation  kernel”  or  the  “generalized  translation  operator,”  which 
yields  readily  both  the  potential 


V(r)  * -2-— *oK(r.r) 
dr 


(35) 
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DIRECT  PROBLEM 

Using  an  arbitrary  solution  of  Eqs.  (39)  in  place  of  tL  auxil  ary  function”  and  constructing  the  transformation 
kernel  yields  examples  of  solutions  of  the  Sdirodinger  Eq.  (32),  which  can  be  exactly  calculated  if  (r,r)  is 
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conveniently  cho*en.  As  an  example,  if  fairs')  is  the  product  a sinh  (k/r)  sinh(fc,V),  we  obtain  a so- 
called  “Bargmann  potential’'  [Bargmann,  1949]. 

The  second  way  of  using  the  above  scheme  is  to  relate  the  experimental  results  to  the  “auxiliary  function,'' 
obtaining  a solution  of  the  inverse  problem.  In  the  inverse  quantum  scattering  problem  at  8 ■ 0,  where  the  above 
procedure  exactly  applies,  this  is  done  by  relating  f£(r,r')  to  l/(k)l  by  a spectral  expansion  and  relating  6 (k)  to 
l/(*)l  by  a dispersion  relation  (discussed  in  a later  section). 

The  logic  of  the  machinery  described  above  applies  to  a large  number  of  inverse  methods.  Moreover,  the 
analogies  are  generally  not  restricted  to  the  general  scheme  but  also  imply  analogous  partial  differential  equations 
and  integral  equations.  Let  us  sketch  rapidly,  by  order  of  increasing  complication,  these  extensions  of  the  Gel’fand- 
Levitan  formalism. 

Replacing  D0(r)  by  D0(r)  - 8(8  + Or-1  readily  yields  the  formalism  for  any  8.  PerVwg  O by  Va. 
the  free- wave  functions  by  the  V0  - perturbed  wave  functions,  and  V by  V-  VQ  readih  we  perturbed 

formalism.  Replacing  the  wave  functions  by  the  Jost  solutions  and  ail  the  boundary  con .<  u the  origin  by 

asymptotic  conditions  at  + «*  we  obtain  the  so-called  “Marchenko  formalism”  [Marchenko  Agranovich  and 
Marchenko,  1963;  and  the  quoted  references] . Both  the  Gelfand-Levitan  and  the  Marchenko  .wnualism  can  be 
extended  by  using  vectors  in  place  of  the  wave  functions,  yielding  generalizations  of  the  procedure  to  coupled  equa- 
tions [Newton  and  Jost,  1955;  Anew,  1956],  to  the  scattering  of  spin  1/2  particles  by  a tensor  force  [Newton.  1955; 
Agranovich  and  Marchenko,  1958] , and  to  coupled  channels  [Cox.  1962] . 

To  study  the  inverse  scattering  problem  at  fixed  E,  in  which  the  “experimental  result"  is  the  set  of  the  phase 
shifts  6g  at  the  given  energy,  it  is  again  possible  to  use  the  formalism  quoted  above,  with  the  following  replacements. 


D0(f)  - r 


(5 


■) 


(41) 


0(0  - D0[r)  -r3  MO 
k2  •+  -8(8  + 1) 

(t  \ut 

Yr)  y8+l/2(r> 


(4?) 


Moreover,  the  Gelfand-Levitan  integral  Eq.  (40)  and  the  generating  formula  in  Eq.  (36)  are  replaced  by  the  Regge- 
N<*w»on  integral  equation 


K(r,r)  - f{r,  r')  - f ' *(;.  p)f(fi,r)o^  dp 


(43) 


and  the  generating  formula 


iKM  * ^o(8,0  ~ yrA(r,p)il»o(8,p)p”*  dp 


(44) 
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This  formalism  was  introduced  by  Regge  ( 19S9] . To  construct  solutions  of  tU  inverse  problem,  Newton 
[1962]  put  a restriction  on  the  class  of  possible  dynamics  by  choosing  a function  f(r,  r')  of  the  form 


wo 

nr.  r)  - ]Pc£  *0(8,r)*0(8,r') 
8*0 


(45) 


where  (8+  1/2)  c£  is  absolutely  and  uniformly  bounded.  The  cg  can  be  obtained  from  conveni  ntly  bounded 
sets  of  phase  shifts  by  matrix  inversions  [Newton,  1962 , Sabatier,  19663] . The  potential  thus  obtained  >s  not  unique, 
unless  sufficient  decrease  of  the  potential  is  imposed  f.*> batter , 19663] . Geneislizationt  have  been  given,  increasing 
the  class  of  possible  dynamics  [Sabatier,  1966c,  1967 «;  Coudray  and  Cot,  1970 1 . 

Study  of  the  inverse  problem  at  fixed  energy  for  spin-orbit  potentials  and  for  tensor  forces  requ*es  a deep 
transformation  of  the  "artial  differential  equations  and  the  integral  equations,  introducing  a nonlinear  coupling.  A 
method  for  discovering  procedures  of  this  kind  has  been  propose  1 by  Sabatier  [1968] , wl:>>  studied  the  spin-orbit 
case.  Hooshyear  [1970]  gave  a procedure  to  deal  with  both  the  spin  orbit  and  the  tensor  potential,  lr.  both  cases,  the 
general  logic  of  the  method  described  is  present. 

Thus,  generalizations  of  the  Gelfand-Levitan  machinery  cover  a number  of  inverse  problems  in  quantum  scatter- 
ing theory.  Still  more  important,  many  other  inverse  scattering  problems  can  be  reduced,  often  by  elementary  trans- 
formations, to  mathematical  problems  formally  identical  to  the  inverse  problems  of  quantum  mechat..cs.  Examples 
can  be  found  in  the  inverse  problem  for  strings  [Krein,  1953;  Kac,  .965] , in  the  inverse  pio'olem  for  transmission 
lines  [Kay,  1971,  and  quoted  references] , in  the  inverse  seismic  problem  [Blagovtshchemkii,  1967;  AW  and  Ware, 

1970] , and  in  the  inverse  problem  for  plasmas  [Degasperis,  1970] . 

Now,  what  kind  of  answer  can  be  given  to  our  fundamental  questions  by  the  procedures  studied  above.  In 
certain  cases,  it  is  possible  in  this  way  to  get  a complete  solution  of  an  inverse  problem.  However,  the  completeness 
of  the  solution  is  not  related  to  the  structure  of  the  method,  but  to  the  completeness  of  certain  spectral  * sponsions , 
so  that  its  study  is  more  conveniently  deferred  to  the  next  section. 

On  the  other  hand,  it  follows  readily  from  the  very  structure  of  the  method  that  any  “auxiliary  function”  that 
is  a solution  of  a certain  partial  differential  equation  can  be  used  to  construct  solutions  of  the  direct  problem.  There- 
fore, tc  solve  the  inverse  problem,  it  is  sufficient  to  select  a class  of  auxiliary  functions  having  a one-to-one  correspond- 
ence with  the  experimental  results.  Since  this  class  is  selected  for  technical  reasons,  it  sol/es  question  3;  therefore, 
provided  some  attention  is  paid  to  convergence  problems,  it  may  solve  question  1 and  give  a partial  answer  to  2. 
Furthermore,  it  may  be  relatively  simple  to  study  the  continuity  of  the  inversion  procedure  and  to  answer  question 
6.  But,  since  the  special  class  of  dynamics  in  which  a solution  is  sought  is  not  chosen  for  physical  but  for  technical 
reasons,  there  is  no  hope  to  answer  questions  4 and  5. 

The  same  conclusion  is  valid  for  all  Jie  methods  available  in  the  study  of  inverse  problems  in  the  relativistic 
case  [Corinaldesi,  1954;  Verde,  1958, 1959;  hats  and  Tod,  1959  .Gasymov,  1968;  Degasperis,  191Q-,  Corbella,  1970]. 

In  concluding  our  discussion  of  the  Gelfand-Levitan  machinery  and  similar  methods,  we  note  that  the  theory  of 
generalized  translation  operators  [Levitan,  1964]  is,  in  principle,  the  right  tool  for  constructing  new  methods  of  this 
type  [see,  for  example,  Coudray  and  Cos,  1970] , but  in  many  cases  more  particular  methods  are  more  efficient  [see, 
for  example,  Sabatier,  1968] . It  also  is  worth  noting  that  incomplete  Gelfand-Levitan  methods  have  sometimes 
been  used  for  practical  applications  [see,  for  example,  Kim  and  Vasavada,  1966,  for  elementary  particles;  and  Portinari, 
1967,  for  optics] . 
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Problems  Exactly  Reducible  to  an  Integral  Equation 

Some  inverse  problems  can  be  exactly  reduced  to  an  integral  equation.  The  fundamental  questions  1 through 
6 are  then  studied  with  regard  to  the  integral  equation.  If  this  equation  is  linear,  a great  wealth  of  results  ir  available 
in  the  literature,  and  all  the  question.'  can  easily  be  studied.  This  does  not  mean  that  the  studies  are  always  pleasant. 
As  we  have  seen,  the  Fredholm  equation  of  the  first  kind,  which  is  often  encountered,  necessitates  several  assump- 
tions to  answer  questior  1 and  makes  the  study  of  question  5 difficult.  However,  a much  more  unpleasant  case  is 
the  nonlinear  equation  encountered  in  the  construction  of  the  phase  shifts  from  the  cross  sections  [Newton,  1968; 
Martin,  1969] ; in  this  problem,  whose  solutions  were  reviewed  by  Newton  in  Chapter  S,  the  only  way  of  proving 
the  existence  of  solutions  is  to  use  fixed-point  theorems.  The  Banach  theorem,  which  has  the  remarkable  character- 
istic of  giving  a constructive  method  of  solution,  works  when  the  cross  section  satisfies  certain  bounds,  yielding  one, 
and  only  one,  solution.  It  is  possible  to  increase  slightly  the  domain  of  existence  and  uniqueness,  and  the  domain  in 
which  the  existence  of  the  solution  is  guaranteed,  by  using  the  Leray-Schauder  theorem.  Unfortunately,  even  the 
largest  domain  i'  not  very  useful  for  applications. 

Strong  theorems  of  analysis  have  been  used  in  certain  inverse  problems  to  give  answers  to  questions  1 or  2 but 
are  generally  not  very  efficient  in  giving  ^instructive  methods  [see,  for  example,  Mireles,  1965;  and  Hunter  and 
Bates.  1970]. 


Research  of  Condensed  Properties 

Usually,  in  inverse  problems,  the  construction  of  the  dynamical  functions  from  the  experimental  results  is  quite 
complicated.  Therefore,  it  is  interesting  tc  look  for  properties  of  the  dynamical  functions  that  can  be  readily  derived 
from  the  experimental  results.  An  example  of  these  properties  is  given  by  the  following  exact  relationship  between 
the  value  of  the  potential  at  the  origin,  the  scattering  phase  shift,  known  as  a function  of  the  momentum  k , and  the 
(negative)  bound-state  energies; 


[*Tj(*)}*d*  - 4]TVn  (46) 

n 


This  formula  was  obtained  by  Newton  [1956]  through  variational  methods.  Extensions  of  the  result  to  the  deriva- 
tives of  the  potential  at  the  origin  have  been  given  by  Faddeev  [1957 ) , Busiaev  and  Faddeev  [1960]  .Percival  [1962], 
Roberts  ['  i], Busier*  [1967] .Cqhgero and Degasperis  [1968] , ana  in  the  relativistic  case, Degasperis  [1970]. 

These  results  were  reviewed  b>  Calogero  in  Chapter  5. 


INFORMATIVE  SOLUTIONS 


As  noted,  the  central  problem  in  the  previous  section  was  the  construction  of  the  dynamics.  Here  the  central 
problem  is  the  determination  of  the  amount  of  information  contained  in  the  experimental  results,  which  can  be 
pursued  by  three  types  of  investigation.  The  first  approach,  the  informative  formulation  of  the  problem,  which 
applies  to  problems  which  are  too  intricate  to  be  solved,  consists  of  - direct  appraisal  of  the  spreading  of  the  inverses 
of  the  functional  M . The  second  consists  of  completely  solving  the  problem  in  a priori  well-defined  mathematical 
classes,  yielding  what  we  call  “a  complete  solution”  of  the  inverse  problem.  It  is  then  possible  to  perform  the  third 
way  of  investigation-  that  is,  appraisal  of  the  dia,.  .ers  of  the  subsets  of  equivalent  potentials  in  C- 
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Informative  Formulations 


The  best  example  of  these  formulations  is  the  Backus  and  Gilbert  [1968]  method  for  the  inverse  problems  in 
geophysics.  It  is  assumed  that  the  class  C of  all  the  possible  dynamics  (here,  the  earth  models),  is  a linear-normed 
space  (here,  the  continuous  functions  of  r , with  values  in  R„),  and  one  defines  each  way  of  measurement  by  a 
mapping  of  C intof;  Mi  is  called  a “gross  earth  functional.”  A finite  collection  of  mappings  M/  defines  a 
set  of  acceptable  dynamics.  The  authors  assume  that  the  functionals  M/  are  differentiable  and  replace  them, 
locally,  by  their  differential.  This  leads  to  the  study  of  linear  functionals  Mi , which  in  a simple  case  have  the  form 


gj(m)  = fGi(r)m(r)dr 


where  G,(r)  is  a known  function  of  r.  If  the  measured  value  of  g,(m)  is  7{-,  the  dynamical  function  m(r)  is 
defined  by  the  generalized  moment  problem 


7,  = fGj(r)m(r)dr  (48) 

Clearly,  the  problem  has  an  exact  solution  if  it  is  possible  to  construct  a kernel  A (r^,  r)  that  is  a linear  combination 
of  the  Gj(r)  and  such  that,  for  any  function  m(r)  in  the  class  of  dynamics  which  are  studied,  and  for  any  r0 , 


m(r0)  = fA(r0,r)m(r)dr 


If  such  a function  is  known,  and  is  equal  to 


the  system  of  Eq.  (48)  has  the  solution 


Mr0S)  = TfitoGiV) 


m(To)  =/?i(ro)  7j 


The  solutions  to  the  inverse  problems,  except  in  very  narrow  classes  of  dynamical  functions,  are  not  unique,  and  it  is 
not  possible  to  construct  a kernel  A (r0,r)  satisfying  Eq.  (49).  But  is  is  sometimes  possible  to  construct  approximate 
kernels  A ( rQ , r),  namely,  unimodular  linear  combinations  of  the  G,-(r)  that  are  in  some  sense  nearly  5(r-  ra). 

More  precisely,  one  selects,  throughout  the  unimodular  linear  combinations  of  the  G, •(/■),  the  one  that  is  most  nearly 
o(r~  r0)  (or,  in  Backus  and  Gilbert  terminology,  has  the  “best  8-ness”).  Thus,  such  a method  gives  an  “average” 
solution  of  the  inverse  problem;  more  important,  it  gives  an  appraisal  of  the  deviation  from  each  other  of  the  equiva- 
lent solutions.  This  appraisal  is  readily  related  to  the  8-ness  of  the  kernel  that  can  be  effectively  constructed  from  the 
knowledge  of  the  functionals  G,-(r)  and  the  experimental  data.  The  deviation  can  be  visualized  in  another  way:  one 
of  its  consequences  is  the  existence  of  a dynamical  function  m(r),  which  averages  to  ouer  all  but  small  length 
scales,  and  to  the  first  order  represents  a possible  deviation  of  any  computed  dynamical  function.  If  this  is  the  only 
source  of  the  differences  among  the  equivalent  dynamics,  there  will  be  some  smallest  length  such  that  all  the  accept- 
able dynamics  have  approximately  the  same  average  over  an  interval  of  length  £ at  radius  r ; £ is  called  the  resolv- 
ing length  at  radius  r.  Backus  and  Gilbert  [1970]  were  also  able  to  determine  how  the  errors  in  the  data  affect  the 
resohnrg  length  of  the  data. 
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Now,  the  great  value  of  this  method  clearly  lies  in  its  physically  meaningful  presentation  of  the  consequences 
of  nonuniqueness.  In  the  frame-work,  of  inverse  problems  in  which  the  functionals  are  reasonably  represented  by 
their  differential,  it  yields  a fairly  good  answer  to  questions  4 and  5. 

Earlier  we  showed  how  an  “inverse”  study  completed  on  the  differential  of  a mapping  could  be  extended  to  the 
mapping  itself,  by  using  the  Banach  method  of  successive  approximations.  This  extension  is  valid  in  a certain  open 
ball  of  the  normed  space,  with  a radius  related  to  the  norm  of  the  selected  inverse  mapping.  Clearly,  if  we  could  find 
a covering  of  the  v/hcle  normed  space  C by  these  open  halls,  the  studies  on  the  differentials  would  be  fully  extended. 
However,  such  a covering  cannot  be  found.  Wheie  the  distortion  is  too  greet,  equivalent  potentials  escape  this  schune, 
and  only  complete  solutions  (or,  in  lucky  cases,  random  investigations)  are  able  to  detect  them. 


Complete  Solutions 


Let  C be  the  set  of  all  possible  dynamical  functions.  A complete  solution  of  the  inverse  problem  would  be  a 
method  of  solution  giving  all  the  equivalent  solutions  in  C . This  definition  is  too  strong,  because  it  includes  very 
unphysical  complications,  such  as  solutions  that  are  almost  everywhere  equal  and  nevertheless  should  be  considered 
as  distinct  solutions  as  defined  above.  Therefore,  we  use  a slightly  weaker  definition which  C is  replaced  by  “a 
mathematically  well-defined  subclass  of  C , which  is  dense  in  C for  a reasonable  norm  and  whose  definition  inside 
C is  done  a priori  by  simple  mathematical  requirements”  (not  a posteiiori,  to  make  the  problem  solvable,  as  in 
incomplete  solutions).  Let  us  di  .card  the  problems  that  reduce  to  a single  equation  and  can  be  studied  by  well-known 
methods.  The  following  complete  methods  are  available  in  more  complicated  inverse  problems. 

In  the  inverse  scattering  problem  at  fixed  £,  where  the  Gelfand-Levitan  formalism  applies,  it  is  possible  to 
show  that  fir,  r)  has  the  expansion 

/°°  m 

d LV  0o(k/V>o(k/>'<J  jl/(*)l_J  - 'j  + ^ C;  sinh  (kjr)  sinh (*/)  (52) 


where  the  kj  correspond  to  the  bound  states  (k2j  = -Ej).  This  expansion  holds  and  is  unique  for  any  given 
potential  of  a large  class-say  the  class  U -of  continuous  functions  K(x)  such  that 


jT°°x  i y(x)  i dx 


is  finite  for  any  e > 0.  If  the  phase  shift,  as  a function  of  k and  the  bound-states  parameter,  is  consistently  given, 
l/(k)l  and  fir,  r')  can  be  constructed  from  which  K (r,  r')  and  the  potential  can  be  derived.  Therefore,  the  non- 
uniqueness  in  the  inverse  problem  [where  5 (k)  only  is  known]  comes  from  the  bound-state  arbitrariness. 

In  the  Marchenko  formalism,  the  relation  between  the  auxiliary  function  fir.  r')  and  the  experimental  results 
is  still  simpler  than  in  the  Gelfand-Levitan  formalism;  fir,  r)  is  equal  to  a function  of  (r  + r’),  say  Fir  + r'),  which 
has  the  expansion 


Fix) 


— [S(k)  - 1]  eikx  dk  + £ Sie  ~kjx 

2n  •'-<» 


(53) 
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where  S(k)  is  the  (experimentally  measurable)  scattering  function  (S(k)  = exp  [2/6  {k)]),  and  the  1 and  Sj 
refer  to  bound  slates.  Again,  this  expansion  exists  and  is  unique  for  any  potential  in  U . Clearly,  the  Marchenko 
method  is  simpler  to  apply  to  the  inverse  problem  at  fixed  8 than  Gell'and-Levitan’s.  This  improvement  i:.  cer- 
tainly related  to  the  fact  f t the  Marchenko  approach  reproduces  more  faithfully  the  physical  path  of  the  infor- 
mation. A further  consequence  is  that  «he  Marchenko  method  has  applications  in  3 larger  area  of  scattering 
problems,  such  as  the  transmission-line  inveise  problems. 

Each  unique  expansion  of  a function  f(r,  r)  has  its  origin  in  the  spectral  analysis  of  the  boundary-value  problem 
consisting  of  the  (differential)  wave  equation  and  the  boundary  conditions  required  for  its  solution.  If  the  dynamical 
functions  belong  to  a suitably  defined  class,  this  boundary-value  problem  has,  in  the  energy  plane,  a continuous  spec- 
trum coinciding  with  the  entire  positive  real  axis  ( E > 0),  and,  possibly,  a finite  number  of  non-positive  eigenvalues. 
The  corresponding  eigenfunctions  are  a total  subset  in  the  space  of  functions  f(r,  r\  corresponding  to  the  “well- 
behaved”  class  of  potentials. 

The  spectral  analysis  has  been  applied  by  Loefftl  [1968]  to  the  inverse  quantum  scattering  problem  at  fixed 
energy.  He  was  able  to  prove  that  f(r,  r)  has  a unique  spectral  expansion,  and  that  this  expansion  can  be  related  to 
properties  of  the  dynamical  interpolation  of  the  lost  function.  Unfortunately,  the  Jost  function  is  not  related  to  the 
phase  shifts  in  as  direct  a way  as  in  the  inverse  problem  at  fixed  8 , and  no  constructive  method  can  be  obtained  in 
this  way.  However,  this  has  given  partial  answers  to  questions  1 and  2. 

The  complete  method  given  by  Sabatier  [1972]  to  the  inverse  scattering  problem  at  fixed  energy  follows  from 
a Fourier  transform  property  of  /(r.  #■*).  It  can  be  shown  [Sabatier,  1970] , for  a subclass  E of  u that  is  not  much 
narrower  than  U , that  f(r,  r)  can  be  written  as 


M OO 

(rr’)  1 /(r,  r)  = 4 tJ/n  J w-1  sin  w u F(u)  du  (54) 

o 

where 


h>  = [(r  - r')*  + 4 rr'u2] 117 

(55) 

(f^u)  for  u < 1 

F(u)  = 

(56) 

1 ^i(“)  for  u < 1 

where  uF,(u)  is  a continuous  function  and  F3{u)  belongs  tc  I2(l,®>).  ”h*  important  point  is  that  F3(u), 

can  be  chosen  (almost)  arbitral  Jy.  Then  the  part  of  f{r,  r ),  corresponding  to  F,fu),  say  f^r,  r'),  can  be  expanded  in  a 

double  Vessel  series 


flir.r')  = ^ CE  "gW1 “g(r')  (57) 

8=0 


where  the  <v  are  related  to  F3(u)  and,  most  important,  can  be  obtained  by  several  niatrix  inversions  from  the  5g 
once  F3(u)  has  been  chosen.  In  that  work,  the  author  did  not  use  the  Regge-Newton  formalism  but  an  asymptotic 
form  of  this  formalism,  obtained  by  keeping  only  the  asymptotic  part  of  the  operators  [Sabatier,  1970] . Such  a 
formalism  has  the  advantage  of  avoiding  as  much  as  possible  the  dispersion  of  information  in  complicated  functions 
of  several  variables.  It  enabled  the  author  to  put  the  problem  in  the  form  of  a generalized  moment  problem  for  a 
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function  bijectively  related  to  the  dynamical  function,  the  so  called  “scattering  structure”  function.  Somewhat 
analogous  (but  less  useful)  results  can  be  obtained  for  the  inverse  problem  at  fixed  £ using  the  Krein’s  method, 
and  can  probably  be  obtained  in  most  scattering  problems. 

Approximation  theory - Any  complete  method  in  which  algorithms  yield  the  equivalent  solutions  can 
theoretically  be  used  to  study  the  “approximation”  question  4.  However,  from  a practical  point  of  view,  an 
approximation  theory  is  most  easily  constructed  if  it  is  possible  to  associate  to  the  dynamical  function  a function 
of  a well-known  space,  such  as  L2(0,°°),  in  which  approximation  theories  are  well  known.  This  is  easy  in  the  com- 
plete method  given  by  Sabatier  [1971] , because  of  the  properties  of  F2(u).  It  is  possible  to  derive  the  diameter  of  the 
set  of  equivalent  potentials  characterized  by  given  bounds  op.  the  derivatives,  and  to  predict  that  for  a static  potential 
this  diameter  goes  to  zero  as  C -*•  °°  and  is  generally  smaller  if  the  bounds  on  the  derivatives  are  smaller.  These  rest  ■I'" 
have  been  checked  by  numerical  computations  [Sabanei  u.<  Qityen  Van  Phil,  1971).  In  the  inverse  problem  at 
fixed  £,  Krein’s  method  has  been  used  by  Melnikov  [ 196 1 J to  study  approximate  methods. 

Advantages  of  the  complete  methods - Clearly,  the  complete  methods,  when  thev  are  constructive,  give 
answers  to  questions  1 through  5.  The  continuity  of  algorithms  for  the  construction  ot  the  equivalent  potentials  can 
be  checked  to  give  answers  to  question  6.  Is  it  possible  to  conclude  that  the  complete  methods  are  perfect?  Unfor- 
tunately, they  have  two  defects;  First,  they  are  very  difficult  to  obtain,  except  in  relatively  simple  cases;  and  second, 
they  are  complicated.  However,  these  two  properties  may  be  the  most  constant  characteristics  of  inverse  problems. 


MATHEMATICAL  GENERALIZATIONS 


Tlie  inverse  methods  involve  many  ma«hematic*'  tools,  most  of  which  are  beyond  the  average  level  of  mathe- 
matical tools  in  physics.  Therefore  it  is  not  surprising  that  an  important  body  of  mathematical  literature  on  inverse 
problems  has  developed.  Moreover,  physical  inverse  problems  have  suggested  the  study  of  many  math;  natical  in- 
verse problems.  After  all,  the  problems  of  construction  of  a function  of  certain  classes  such  as  harmonic  functions, 
or  holomorphic  functions,  from  its  values  on  a boundary  are  in  a certain  sense  inverse  problems.  If  we  limit  our  point 
to  inverse  problems  readily  originating  from  physics,  it  is  worth  mentioning  the  studies  made  to  clarify  or  formalize 
numerical  methods  or  improperly  posed  problems  (see,  for  example,  Lavrentiev,  1967] . However,  by  far  the  most 
numerous  mathematical  studies  connected  with  inverse  problems  have  been  those  of  inverse  spectral  problems  (con- 
struction of  an  operator  from  a spectral  function)  and  the  theory  of  generalized  translation  operators.  These  subjects 
have  been  extensively  studied  by  Russian  mathematicians,  in  particular,  Gelfand,  Levitan,  Marchenko,  Krein, 
Agranovich,  Fadoeev,  and  Berezanskii,  whose  works  have  been  cited  many  times  (see  also  references  to  Naimark, 
Ljance,  Gorbacuk,  and  Kac).  The  research  of  general  properties  of  Sturm-Liouville  problems,  has  been  of  interest  in 
mathematics  since  the  193C’s.  Many  general  results  [Titchmanh,  1940]  are  of  interest  in  the  physical  inverse  prob- 
lems. More  particular  results  have  been  devised  for  the  inverse  problem  in  quantum  mechanics  (see,  for  example, 
Borg,  1947;  Levinson,  1949«,6] . Other  results  could  easily  be  used  for  physically  interesting  generalizations  [Crum, 
1955,  Friedman,  1957]. 

All  these  mathematical  studies  have  the  outstanding  characteristic  of  being  rigorous:  The  class  in  which  the 
solution  is  sought  is  correctly  defined,  integrals  and  series  that  are  used  converge,  and  except  for  some  casual  and 
trivial  errors  without  important  implications,  the  results  can  be  taken  for  granted.  Unfortunately,  this  is  not  always 
true  in  physical  papers.  Therefore,  it  is  very  important  to  enhance  and  increase  the  curiosity  of  mathematicians 
relative  to  the  physical  inverse  problems.  On  the  other  hand,  it  also  is  true  that  matnematicians  tend  to  study  the 
simplest  inverse  problems  with  the  weakest  assumptions,  instead  of  giving  solutions  of  more  difficult  problems  with 
stronger  assumption;-.  Nevertheless,  ?asy  to  see  that  among  inverse  problems,  the  ones  which  have  awakened 
the  interest  of  mathematicians  are  now  the  best  known.  Are  they  well  known  because  they  have  been  studied  by 
mathematic  ians,  or  are  they  studied  by  mathematicians  because  they  are  easy  to  understand? 
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CONCLUSION 


The  direction  of  the  evolution  of  inverse  problems  is  determined  by  a qualitative  evolution  of  the  information 
sought.  Physicists  first  try  to  obtain  a solution  of  the  problem,  whatever  it  may  be,  either  exact  or  approximate; 
for  this  purpose,  approximate  methods  and  numerical  methods  are  the  best  tool.  They  soon  discover  that  the  domi- 
nate feature  of  the  inverse  problem  is  the  lack  of  uniqueness  of  solutions,  and  it  becomes  necessary  to  determine 
whether  this  feature  is  due  merely  to  experimental  uncertainties  (well-posed  problem)  or  to  the  very  structure  of  the 
problem  (improperly  posed  problem).  In  any  case,  the  problem  moves  toward  the  information  questions.  Roughly 
speaking,  the  physicist  goes  from 

“Her:  are  the  experimental  results.  Give  a dynamical  function  to  fit  them.”  to  ‘Here  are  the  experimental 
results.  What  amount  of  information  do  they  contain?” 

Apart  from  this  general  trend,  the  inverse  problems  exhibit  rr  features  in  common,  the  origin  of  which  can 
easily  be  ascertained.  The  Abel’s  integral  equation  is  clearly  a subproduct  of  a turning  point  in  a classical  trajectory. 
The  perturbation  method,  which  is  a very  general  tool  in  functional  analysis,  produces,  to  the  first  order,  liner r 
functionals.  The  integral  form  of  such  a function  being  of  a large  generality  in  analysis,  Fredholm  integral  equations 
come  in  a natural  way.  And  the  Gelfand-Levitan  and  derived  methods  can  be  related  to  properties  of  self-adjoint 
operators  in  scattering  problems. 

For  an  interdisciplinary  working  program  on  inverse  problems,  we  think  that  these  three  tools— classical  methodr, 
inversion  of  linear  functionals,  and  spectral  methods- should  be  three  chapters.  A fourth  one  would  deal  with  the 
numerical  methods  of  identification,  with  optimizing  programs.  A fifth  one,  perhaps,  would  deal  with  statistical 
numerical  methods.  In  all  these  studies,  the  six  fundamental  questions  presented  earlier  must  be  studied  and  answered. 

Now  there  is  a further  point  which  may  be,  in  the  future,  of  fundamental  interest.  In  our  opinion,  inverse- 
problem  studies  should  be  used  not  only  to  explain  experiments  but  also  to  plan  them.  For  a given  model,  inverse 
methods  can  be  used  in  designing  experiments  in  a way  that  more  information  can  be  extracted  from  fewer  experi- 
ments. When  this  is  done,  the  studies  of  inverse  problems  will  be  the  most  important  tool  in  scientific  research. 
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DISCUSSION 


Shm.ys:  What  kind  of  bias  are  we  putting  in  when  we  restrict  the  class  of  potentials?  It  is  net  clear  whether  the 

Backus-Gilbert  method,  which  appears  to  remove  some  of  that  bias  or  formalize  the  procedure,  is  the  best. 
Whatever  procedure  is  used,  there  is  always  the  question  of  somehow  smoothing  the  information.  We  either 
restrict  the  class  of  potentials  which  we  are  trying  to  construct  or  we  smooth  the  data  we  obtain  immediately 
and  proceed  analytically.  This  restriction  must  occur,  and  it  is  not  obvious  which  end  to  put  it  in,  at  the  data 
end  or  class  of  potential  end. 

Sabatier:  In  any  case  you  have  to  put  a priori  constraints  on  the  class  of  dynamical  functions  you  are  looking  for. 

If  you  know  what  constraints  you  are  making  you  are  then  OK. 

Unidentified  speaker:  There  is  one  aspect  deserving  more  emphasis,  which  is  illustrated  by  the  Backus-Gilbert 

procedure.  We  should  concentrate  on  how  much  information  is  contained  in  the  data  supplied  by  the  physi 
cist.  These  con  stitute  measured  values,  including  errors  contained,  in  lighi  of  what  we  know  a priori  about 
dynamical  variables.  Then  we  should  determine  what  this  implies  about  the  dynamical  variables.  The  impor- 
tant question  is,  how  much  work  do  we  have  to  perform  to  extract  that  information?  That  subject  is  in  its 
infancy.  The  astonishing  thing  about  Backus-Gilbert  is  that  they  get  throujh  with  a feasible  amount  of 
computation. 

Sabatier:  I agree. 

Parker:  In  the  situation  where  you  have  your  complete  solution  to  the  quantum  mechanical  scattering  problem 

how  do  you  describe  the  content  of  the  physical  measurements? 

Sabatier:  This  deserves  much  study.  Your  question  is  the  problem  of  stability.  If  the  inverse  mappings  are  con- 

tinuous, and  if  they  are  differentiable,  then  you  can  associate  to  a ball  in  the  st'ace  of  experiment  results,  a ball 
in  the  space  of  dynamical  functions.  Then,  if  you  take  experimental  results  with  only  1 5 points,  you  in  fact 
know  more  than  these  15  points.  In  the  Gelfand-Levitan  method  you  know  15  points.  But  from  the  a priori 
constraints  you  have  on  the  potential,  you  can  put  inequalities  on  all  the  other  values  you  don’t  know.  In  this 
case,  then,  you  also  have  an  infinite  set.  You  have  to  put  bounds  on  all  :he  things  you  don’t  know  and  on  the 
experimental  errors. 

Calogero:  At  what  point  one  should  smooth  the  results  is  still  the  question.  One  always  smooths  experimental 

data.  One  must  do  it  sensibly  stating  errors. 

Parry:  In  the  matter  of  cost  of  producing  data,  and  the  aims  set  down  in  the  whole  field  of  geoscience,  these 

measurements  are  extremely  expensive.  I would  like  to  add  another  goal:  We  not  only  want  the  physical 
relationships  and  the  information  content,  but  also  we  would  like  to  have  a unique  one-to-one  correspond- 
ence between  the  physical  parameters  and  the  inverted  parameter. 
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ABSTRACT  N73" 11630 

The  chronological  development  of  multidimensional  inversion  techniques  is  reviewed  briefly.  Some  current 
work  in  this  area  is  described. 

As  we  have  seen,  the  one-dimensional  inverse  problem  has  received  a great  dear  of  attention  ever  the  past  65 
years.  The  literature  is  tremendous  and  the  effort  has  extended  over  a great  many  areas  of  application  as  well  as  the 
basic  mathematic  procedures  involved.  By  comparison  the  extension  to  problems  in  more  than  one  dimension  has 
received  relatively  little  attention.  The  history  of  the  multidimensional  inverse  problem  differs  considerably  from 
that  of  the  one-dimensional  one.  In  the  one-dimensional  problem,  “classical  ’ methods  came  first,  various  numercal 
and  modeling  procedures  were  being  developed  simultaneously  with  perturbation  methods  and  other  approximate 
procedures  for  the  inverse  “wave”  problem;  finally,  exact  methods  for  the  inversion  of  the  wave  problem  were 
devised. 

The  multidimensional  inverse  problem  developed  in  reverse  order.  Exact  inversion  methods  have  been  knewn 
for  almost  20  years  [Moses,  1956! . Classical  methods  using  variation  of  parameters  or  perturbations  have  been  used 
in  recent  years,  but  relatively  sparsely  [Cain  et  a/.,  1966;  Gross  and  Pirragim,  197 1 ] . There  is  as  yet  no  systematic 
procedure  for  classical  inversion  of  a multidimensional  scattering  problem.  One  might  ask.  why  bother  with  classical 
methods  if  wave  methods  are  available?  First,  the  available  exact  wave  methods  appear  to  depend  on  Born-type 
expansions  and  may  therefore  converge  slowly  when  the  phase  shifts  are  large  and  strongly  dependent  on  frequency 
and  direction  of  incidence  due  to  large  extent  of  the  unknown  medium.  Second,  classical  formulation  generally 
results  in  a simpler  problem;  this,  at  least  is  the  case  in  the  one -dimensional  inverse  problem.  The  latter  consideration 
may  be  rephrased  to  say  that  the  classical  problem  requires  a different  type  of  data,  a type  that  is  sometimes  easier  to 
obtain  experimentally.  For  example,  the  time  delay  of  a wave  packet  may  be  easier  to  measure  than  the  phase  shift 
of  the  carrier. 

In  their  impoitant  paper,  I war  a and  Nagata  [1970]  present  a perturbation  treatment  of  the  eikonal  equation. 
The  resulting  system  of  equations,  each  equation  containing  terms  involving  the  solutions  of  all  the  preceding  equa- 
tions, can  be  solved.  Unfortunately,  rather  than  expand  each  perturbation  in  a complete  set  of  functions,  they 
assume  a finite  series,  and  the  coefficients  are  obtained  from  a finite  set  of  observations.  The  method  can  probably 
be  carried  further,  and  conditions  on  the  existence  of  solutions  and  convergence  of  the  series  should  be  established. 

Lavrentiev  et  al.  [1970]  have  provided  a fairly  systematic  presentation  of  multidimensional  inverse  problems. 
The  stepping  stone  for  their  monograph  is  the  determination  of  a function  of  n variables  from  the  values  of  its 
mean  over  a set  surfaces.  The  surfaces  constitute  an  n-parameter  family,  and  the  mean  must  be  known  as  a function 
of  all  the  parameters.  This  basic  inversion  problem  in  integral  geometry  is  treated  carefully,  with  conditions  estab- 
lished for  the  unknown  function  as  well  as  for  the  set  of  surfaces.  Various  inverse  problems  of  partial  differential 
equations  are  reduced  to  this  basic  problem  of  integral  geometry.  In  all  the  problems  treated,  the  spatial  domain  of 
the  partial  differential  equation  is  the  half  space,  and  the  data  available  are  assumed  to  be  the  response  to  an  impulse 
source  located  at  a point  on  the  boundary  as  observed  at  another  point  on  the  boundary. 
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Lavrentiev  et  aL  deal  with  the  Klein-Gordon  type  equation,  with  variable  coefficients,  giving  results  that  would 
be  applicable  directly  to  plasma  diagnostics.  The  unknown  coefficient  function  is  taken  to  be  a perturbation  of  a 
constant.  This  perturbation  can  be  calculated  from  the  response  as  a function  of  the  coordinates  of  the  source  and 
observation  points. 

In  dealing  with  the  nondispersive  scalar  wave  equation  with  variable  refractive  index  is  delt  with,  Lavrentiev  et 
al.  immediately  restrict  the  information  required  to  the  classical  variable,  time  of  arrival  of  the  wave  front.  Again 
the  problem  is  considered  as  a perturbation  problem,  this  time  of  a one-dimensional  index  variation  which  is  assumed 
known.  The  time  delay  as  a function  of  source  and  observation  point  locations  can  be  inverted  to  yield  the  perturba- 
tion in  refractive  index.  Other  applications  of  this  approach  are  given. 

Clearly,  while  the  multidimensional  inverse  problem  is  finally  receiving  some  attention,  we  are  not  yet  at  the 
end  of  the  road.  The  utilization  of  temporal  response  to  an  impulse  both  in  the  multidimensional  as  well  as  in  the 
one-dimensional  case  has  interesting  possibilities.  Perturbation  procedures  might  be  useful  if  they  can  be  shown  to 
be  sufficiently  rapidly  convergent  for  reasonable  amplitudes  of  the  perturbation. 
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